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present invention provides a purified and isolated DNA-binding protein, HPBF, which specifically binds to the promoter region 
Ha-2/nezi (ERBB2/c-er£B-2) gene sequence, the presence of which provides an early indication of transition to a cancerous state 
i >und. The present invention also provides bioassays for screening substances for the ability to inhibit HPBF activity, the ability 
he mitogenic activity of HPBF and the ability to inhibit HPBF production* The present invention further provides methods of 
[he biological activity mediated by HPBF comprising preventing the HPBF from binding to the promoter region of the ERBB2 
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ERBB2 PROMOTER BINDING PROTEIN 
IN NEOPLASTIC DISEASE 

BACKGROUND OF THE INVENTION 

5 

FIELD OF THE INVENTION 

The present invention relates generally to the field of medical diagnosis 
and specifically for monitoring the presence of neoplastic diseases at an early stage to 
allow early therapeutic intervention. 

10 

BACKGROUND ART 

Currently, early detection of breast cancer in humans, particularly in 
women, depends on self-examination and mammography. However, routine 
mammography is not recommended for women under 50. Therefore, breast cancers in 
1 5 younger women tend not to be found until more advanced with a correspondingly 
poorer prognosis. Screening methods are needed to identify early stages of the 
transition of normal epithelial cells towards carcinoma in situ before the subsequent 
development of invasive and metastatic cancer. 

20 Breast cancer appears to be genetically and/or morphologically, a 

heterogeneous disease and multiple mechanisms are responsible for the ultimate 
development of breast carcinoma from normal epithelial cells. The Her-2fneu 
(ERBB2/c^riB-2) gene sequence (SEQ ID NO:9), hereinafter referred to as ERBB2, 
appears to be one of the primary genes responsible for the transition of normal epithelial 

25 cells towards carcinoma m situ and the subsequent development of invasive and 
metastatic cancer. However, by the time the gene product of ERBB2 is measurable, 
prognosis is not good. A means of identifying the initiation step for ERBB2 gene 
activity and interfering with that step are necessary for greater success in early 
identification and treatment of breast cancer. 



30 
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Significant progress has been made at the molecular level to dissect the 
role of the ERBB2 gene and its association with breast cancer. However, mechanisms 
that control or initiate the activity of the ERBB2 gene have not been available to give ^ 
early prediction or treatment of breast cancer. The results of some of these molecular 
5 studies are described herein. 



Histologically, breast cancer comprises about 70-85% classified as ductal 
carcinoma; the next largest subgroup is referred to as lobular carcinoma. These two 
major classes of breast cancer comprise more than 80-95% of breast cancer in humans. 

• 10 It has been estimated that 5-15% of breast cancer in women under 50 years of age is 
associated with a genetic propensity for the disease. 143 Several recent studies have 
elucidated some of the inherited mechanisms which are at work in breast cancer. 14 " 17 A 
recent review has described various molecular determinates of growth, angiogenesis and 
metastases which may play a role in breast cancer. In addition, the ERBB2 gene has 

1 5 recently been documented to be prognostically important in breast cancer. 43,45 ' 56 ' 69 

The ERBB2 gene is the human counterpart of the rat neu oncogene 
(SEQ ID NO: 12), originally identified in ethyl iutroso-urea induced rat 
neuroglioblastomas by Weinberg and co-workers. 19,20 The ERBB2 oncogene codes for 

20 a protein of 1 85,000 dalton molecular weight (pi 85 product), and the product is similar 
in overall organization and primary amino acid sequence to the epidermal growth factor 
receptor (EGFR) 21 " 23 A possible ligand for ERBB2 has recently been described. 24 " 26 
The ERBB2 gene is not overexpressed in benign breast tissue, 27 but significantly 
overexpressed in 60% of carcinoma in situ (preneoplastic lesion of breast carcinoma) 

25 and in about 30% of invasive cancer 2W0 

The pi 85 product of the ERBB2 gene is a growth factor receptor with 
intrinsic protein tyrosine kinase activity winch, when deregulated, or disregulated, 
results in unrestrained growth and cell transformation. 32 ' 34 The transforming potential 
30 of the ERBB2 gene is also related to the levels of protein expression. This 

proto-oncogene is also frequently amplified in many human tumors and in cell lines 



WO 95/28485 



PCT/US95/04953 



3 

derived from tumors. 33,35 * 38 ERBB2 gene overexpression in the absence of gene 
amplification has also been described 33,36 " 38 The ERBB2 gene product is a potent 
oncoprotein when overexpressed in NIH-3T3 cells. 34 In a transgenic mouse model 
experiment, transgenic mice were created 39 ' 40 expressing the activated form of the rat 
5 neu proto-oncogene, under the control of steroid inducible promoter, and uniformly 
developed mammary adenocarcinoma. In addition, ERBB2 gene amplification in human 
breast tumor is often associated with poor patient prognosis. 33,3 * The overexpression of 
ERBB2 has also been associated with poor prognosis in non-small cell lung cancer. 41,85 

• 1 0 A convincing body of clinical and experimental evidence thus supports 

the role of ERBB2 protein in the progression of human cancers characterized by the 
overexpression of this oncogene product. Important aspects of this evidence include the 
poor prognosis of breast, ovarian and non-small cell carcinoma patients whose tumors 
overexpress ERBB2 protein, as well as observations which indicate that modulation of 

1 S ERBB2 protein activity by a monoclonal antibody can reverse many of the properties 
associated with tumor progression mediated by growth factor receptor. 42 

A recent study 43 of 209 consecutive female patients with invasive 
operable breast cancer from a defined urban population observed for a median of 30 

20 years demonstrated that fifty-five patients (26%) had cancer and a positive ERBB2 
oncoprotein stain reaction. They had significantly reduced 10 and 25 years survival 
rates as compared with those patients who had a negative stain reaction in their cancer 
(3 1% versus 48% and 3 1% versus 39% respectively with a P value « 0.004). ERBB2 
gene expression was also found to be associated with reduced survival among patients 

25 who had axillary nodal metastases (P value = 0.003) but not among those patients who 

. did not have metastases. EKBB2 expression was related to the ductal histologic type, 
poor histologic grade and high mitotic count, but not to tumor size, axillary nodal status, 
DNA ploidy or S-phase fraction. In a multivariate analysis among patients with nodal 
metastases, ERBB2 expression was found to be an independent prognostic factor (P 

30 value = 0.004) that predicted poor survival. Based on these data, it was concluded that 
ERBB2 oncoprotein expression has long-term prognostic significance for predicting 
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poor survival in breast cancer and it has an independent prognostic value among patients 
who presented with axillary nodal metastases. The mean survival time for the women 
with ERBB2 expressing group is only 29 months compared to the mean survival time of 
1 10 months of the women with nonexpressing cancer. The difference between the 
survival curve is the greatest at approximately five years from the diagnosis (37% versus 
64%) and diminished toward the end of the follow-up, which indicates that ERBB2 
expressing cancers usually progress rapidly and are fetal The result that ERBB2 
expression predicts poor survival is contradictory to the opinion that it could only be a 
marker for drug resistance, 44 not a marker for poor prognosis. 

Overexpression of the ERBB2 oncogene has previously been correlated 
with poor prognosis in patients with infiltrating breast carcinoma. 33 The authors 
reported a 35% difference in survival at four years for node positive patients with 
ERBB2 positive tumors. 33 This finding was emphasized in later studies with large 
15 numbers of patients 45 It appears that the inconsistencies in the relationship between 
ERBB2 overexpression and mammary carcinoma are related to its correlation with 
tumor type. In studies of infiltrating carcinoma, the proportion of tumors showing 
overexpression has ranged from 10-30%; 28 " 30 * 33,4647 in carcinoma in situ, the incidence 
of overexpression is much higher, in the order of 60%. 28 " 30 

20 

Several studies 45,4 wo have clearly shown that there is no loss of ERBB2 
expression when invasive tumors progress from a pure in situ carcinoma. Therefore, 
there must be some other reason why fewer infiltrating tumors overexpress ERBB2. 
The nuclear sizes of the in situ and infiltrating components were also very similar and as 
25 has been found previously for in situ disease, almost all of the ERBB2 positive cases 
contained some large nuclei. A study 51 has suggested that there are at least three groups 
of infiltrating tumors: 



30 



Group 1 - those composed of cells with small nuclei which have arisen 
from small cell cribriform/micropapillary ductal carcinoma in situ. These have a low 
rate of proliferation and of ERBB2 overexpression. 
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Group 2 - tumors composed of large cells which have arisen from large 
cell comedo ductal carcinoma in situ. These have a high rate of proliferation and 
ERBB2 overexpression. 

5 Group 3 - tumors composed of cells with variable nuclear sizes, but 

inducting some large nuclei, over half of which have a high rate of proliferation, but 
none of which overexpress HRBB2. 

The hypothesis is that the latter group of tumors only have a transient in 

10 situ period and quickly become invasive. Because of this rapid progression to invasion, 
these tumors were not found in these studies of pure ductal carcinoma in situ. They 
made only a minor contribution to that study of tumors with a prominent ductal 
carcinoma in situ component accompanied by a variable infiltrating component but have 
become very obvious in this particular study. This could Explain the dilution of overall 

1 5 ERBB2 positivity seen in studies of infiltrating tumors when compared to pure in situ 
tumors. If this is so, h could be accepted that the presence of L*:BB2 overexpression is 
a marker of poor prognosis, since the ERBB2 positive in situ tumors are always 
composed of large cells, usually of comedo pattern and there are data to suggest that 
such tumors have a greater invasive potential than other patterns of in situ carcinoma. 52 " 

20 35 In cases of infiltrating carcinoma, the ERBB2 positive tumors again contain large 
cells and are rapidly proliferating, both factors being associated with a poor prognosis. 
Whereas tumors with small nuclei and tumors with low proliferative activity are nearly 
always ERBB2 negative, there are also significant numbers of ERBB2 negative tumors 
which contain at least some large cells, and many of these tumors have a high rate of 

25 proliferation. As already suggested, it is possible that this group of tumors has only a 
transient in situ stage. 

Finally, another recent study 56 demonstrated that tumors from 16% of 
the node negative patients and 19% of the node positive patients were ERBB2 positive. 
30 In both groups, ERBB2 positively correlated with negative progesterone receptor, 
negative estrogen receptors and high tumor grade. The expression of ERBB2 was 
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prognosticaUy significant for node positive, but not for node negative patients. Tumors 
with overexpression of ERBB2 oncogene were less responsive to cyclophosphamide 
methotrexate and fluorouracil containing adjuvant therapy regimens than those with a 
normal amount of gene product, suggesting worse tumor behavior. For node positive 
5 patients, the effect of prolonged duration therapy on disease free survival was greater 
for patients without ERBB2 overexpression than those with ERBB2 overexpression. 
Similarly, for node negative patients, the effect of perioperative treatment on disease 
free survival was greater for those without ERBB2 overexpression than for those with 
ERBB2 overexpression. 

10 

United States Patents 4,935,341 to Bargmann et a/., issued June 19, 
1990, 4,968,603 to Slamon et al issued November 6, 1990 and 5,183,884 to Kraus et 
al, issued Febmaiy 2, 1993, provide methods relating to the identification of ERBB2 
gene expression, overexpression and prognostic indicators of breast cancer based on the 

1 5 ERBB2 gene product. The Slamon et al "603 patent discloses amplification of the 
ERBB2 oncogene and its relationship to the status of breast and ovarian 
adenocarcinomas. In particular, the degree of gene amplification provides prognostic 
utility for breast cancer. The Bargmann et al 341 patent discloses mutations in the 
ERBB2 gene which result in an oncogenic state and provide an oligonucleotide probe 

20 capable of hybridizing to the mutated region. The Kraus et al. *884 patent discloses a 
DNA fragment distinct from EGFR and the ERBB2 gene, designated as ERBB-3. 
Marked elevation of ERBB-3 mRNA levels were demonstrated in certain human 
mammary tumor cell lines. 

.25 The above research and patents do not provide information that allows 

screening to identify earlier stages of the transition of normal epithelial cells towards 
carcinoma in situ before the subsequent development of invasive and metastatic cancer. 
These results indicate that the ERBB2 gene is extremely important in a significant 
percentage of breast cancers and the regulation of expression is perhaps a key 

3 0 determining factor in breast cancer development and progression. If the regulation can 
be controlled, transition to a cancerous state can be stopped. 
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Recent studies of cloning and characterization of an ERBB2 promoter 
have compared mouse neu promoter (SEQ ID NO: 15) with human ERBB2 promoter. 57 
(SEQ ID NO: 10; SEQ ID NO: 1 1) The presence of CAAT box and lack of a TATAA 
motif is one way in which the mouse neu promoter differs from the human ERBB2 
5 promoter 58 but is similar to the rat neu promoter. 59 (SEQ ID NO: 13; SEQ ID NO: 14) 
The GGA repeats observed between -204 and -184 (with respect to the translational 
start H ATG" codon) of the mouse neu promoter are also seen in rat 59 neu and human 
ERBB2 promoters. 58 A sequence consensus for SP1 is located at -21 1 of the mouse 
neu promoter. SP1 consensus sequences are also seen in rat neu promoter and the 

10 human ERBB2 promoter in an analogous regioa The sequence GCCGCCGC at -140 in 
the mouse neu promoter is similar to the binding site for G-CSF 60 and is also observed in 
the rat neu promoter but not in the human ERBB2 promoter. A sequence similar to the 
OTF 1 motif, 61,62 but differing by one nucleotide (ATGCAAAC instead of 
ATGCAAAT), is located at position -462. A similar sequence is also seen in the rat neu 

IS promoter and human ERBB2 promoters at equivalent positions. Sequences with 
homology to the AP2 consensus sequence (T/CC/GC/GCCA/CNG/CC/GG/Q 63 are 
located at -328 and -106 of the mouse neu promoter gene; similar sequences are also 
found in the corresponding regions of the rat neu promoter and human ERBB2 
promoter. 

20 

A novel transcription factor termed "KNF" 64 was found to bind to the 
promoter of the rat neu gene. The binding sequence for this factor is also present in 
both the mouse (-439) neu promoter and human ERBB2 promoter. The 
GGTGGGGGGG sequence, termed "GTG n enhancer, which is involved in 

25 autorepression of the rat neu transcription 59 is located at position -249 to -240 in the 
mouse neu promoter. However, the corresponding region of the human ERBB2 
promoter is different Conservation of transcription factor sequences among these three 
species may imply a conserved function. It is not known at the present time whether 
those sequences that are different between rodent and human genes such as CAAT and 

30 TATAA box, GTG enhancer and other motifs might represent species specific functions. 
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This information, together with the feet that multiple transcriptional 
initiation sites are mapped in both the rat neu and human ERBB2 genes, makes h likely 
that the TATAA sequence in the human ERBB2 promoter does not function as a 
transcriptional TATAA box. The previous studies on rat neu and human ERBB2 
5 promoters focused mainly on a region within 1 Kb upstream from the transcriptional 
initiation sites. The current studies on the mouse neu promoter 57 have lead to 
identification of a silencer region approximately three Kb upstream from the 
transcriptional initiation site, similar sequences have not yet been reported in human 
ERBB2 promoter An estrogen responsive region has been found within the rat neu 
10 promoter region. 70 

It has been reported that the expression of the ERBB2 gene is tissue 
specific and developmentally regulated. 65 Transcriptional regulation, therefore, may be 
one of the mechanisms (fector) leading to overexpression Of ERBB2 gene in human 

1 5 cancer cells. Therefore, regardless of the relative distances from the transcriptional 
initiation site, identification of silencer and enhancer sequences controlling ERBB2 
transcription provides important information that may allow clinical information to be 
obtained for studying transcriptional mechanisms resulting in cancer and understanding 
the biological role of ERBB2 gene regulation in breast cancer development, 

20 heterogeneity, progression and recurrence. 

Primary gene induction or repression in eukaryotes does not require de 
novo protein synthesis, suggesting the involvement of post-translational modifications as 
well. In a recent review, 67 it was summarized that many different types of stimuli that 
.25 affect gene expression also led to the activation of protein kinases; it is likely that 

transcription fector function will be directly regulated by phosphorylation. Even though 
other types of post-translational modifications will undoubtedly be important in 
regulating transcription factor function, phosphorylation seems to be one of the most 
important functions which has been studied recently. 67 " 68 

30 

In summary, first, a transcription factor can be sequestered in the 
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cytoplasm and rendered inactive thr ugh lack of access to the target sequences. 
Phosphorylation of the factor itself, or a cytoplasmic anchor protein allows translocation 
of the transcription factor into the nucleus, where it acts, generally by binding to the 
DNA at a specific site by protein-DNA interactioa 73 Second, the DNA-binding activity 
5 of nuclear transcription factor can be modulated by phosphoiylation either posittveiy or 
negatively. 67 " 68 Third, phosphorylation can affect the interaction of transcription factor 
transactivation domains with the transcriptional machinery. 67 " 68 These possibilities are 
by no means mutually exclusive and in principle phosphorylation at multiple sites by 
different protein kinases can result in regulation at several distinct levels. Nuclear 
10 translocation of various transcription factors modulated by phosphorylation has been 
demonstrated recently. 72 



It has been shown that in unstimulated cells, with the notable exception 
of B cells, NFkB (nuclear factor kB) is retained in the cytoplasm in an inactive complex 

1 5 with the intermediary protein (IkB), which cannot bind DNA. 73 ' 74 In response to 
various stimuli, including the phorbol-ester TP A, the IkB-NFkB complex dissociates 
and NFkB DNA-binding activity is detected in the nucleus. 73 DNA binding activity can 
be revealed in unstimulated cytoplasmic extracts by a number of means including 
treatment with sodium deoxycholate, which dissociates the IkB-NFkB complex. 74 

20 Therefore, there is much evidence to suggest that a transcription factor can be found in 
the cytoplasmic extracts, as well as in the nuclear extract. 67 A 
phosphorylation-dephosphorylation mechanism for the translocation of transcription 
factor in numerous systems by protein kinase A and protein kinase C has been 
demonstrated as indicated earlier. Almost every eukaryotic transcription factor that 

25 has been analyzed in detail has proved to be phosphorated. In most cases, however, 
the functional consequences of such phosphorylations, if any, are largely unknown. 

There are only a few possible mechanisms proposed for the regulation of 
ERBB2 gene expression which are summarized as follows: 
30 (0 A recent report has suggested that the E3 region of adenovirus induces down 

regulation of epidermal growth factor receptor. A similar repression of ERBB2 
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expression has also been documented, however, the repressed expression of ERBB2 is 
not through the E3 region of the adenovirus. The repression of ERBB2 expression is 
accomplished by E1A gene product, and it specifically repressed ERBB2 gene 
expression at the RNA level 75 and full basal promoter activity of ERBB2 gene has been 
5 shown to be retained by two fragments of the ERBB2 5* region (-759 to-724 and -396 
to -24 base pair). 

(if) Functional inactivation of both alleles of the retinoblastoma susceptibility 
gene (RB) plays an important role in the etiology of both sporadic and familial 
retinoblastomas and several other types of human cancers, including breast cancer. 76 * 77 

10 The RB gene may have cell cycle control function. 78 ' 79 RB protein function may vary 
during the cell cycle because it shows cell cycle dependent changes in phosphorylation 
and RB protein can be phosphorated by the cell cycle kinase p34 cdc2. 80 RB protein 
can also complex with the transcription factor E2F and inhibit E2F binding to the 
promoters of several cellular proliferation related genes. 81 Recent studies revealed that 

15 RB protein can negatively regulate the immediate early genes of c-fos and c-myc 
expression at the transcriptional level in NIH-3T3 cells. 82,83 RB also stimulates the 
growth inhibitory factor TGF-P 1 expression in certain cell types and subsequently 
suppresses cell growth. 84 Taken together, all of these results suggest that RB may limit 
the progression of cells through the cell cycle by sequestering a variety of nuclear 

20 proteins involved in growth regulatory gene transcription. As indicated earlier the 
amplification and overexpression of ERBB2 is involved in human breast and lung 
cancers. 38 ' 85 Interestingly, inactivation of the RB gene has also been implicated in the 
oncogenesis of human breast and lung cancers 77 ' 86 and may suggest the possible 
molecular link between RB and the ERBB2 gene in the development and progression of 

25 breast cancer. A recent study has shown that the RB protein can bind specifically with a 
GTG-GGGGGGG sequence in the ERBB2 promoter and suppress the promoter 
function. This study has concluded that the RB protein suppresses ERBB2 induced 
transformation by suppressing the ERBB2 promoter activity: 87 

(///) An interesting feature of the human ERBB2 gene promoter is the presence 

30 of two different types of regulatory elements: a CAAT box and SP1 binding sites. 

Transcription from the three most downstream RNA start sites appear to be controlled 
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by the CAAT box and the TATA box, because these are respectively about 30 bp and 
80 bp upstream of the early start sites and these distances are consistent with those in 

Oft 

many other eukaryotic promoters. On the other hand, transcription from the fourth 
RNA start sites located further upstream seems to be controlled at least partly by SP1 . 
5 In contrast with the ERBB2 gene promoter, the promoter region of the human 

epidermal growth factor receptor (EGER) gene does not contain either a TATA box or 
a CAAT box but has 5 SP1 binding sites. Therefore, the expression of the ERBB2 gene 
may be regulated by the transcription factor SP1, a CAAT box binding protein and a 
TATA box binding protein, 89 ** 1 whereas the expression of the EGFR gene serais to be 
1 0 regulated by SP 1 but not by the latter two proteins. 

Since the ERBB2 gene appears to be important in breast cancer, 
treatment modalities have been reported in the literature employing strategies which 
target this gene. A recent report 71 used a monoclonal antibody coupled to a toxin to 
IS target the extracellular domains of the ERBB2 receptor protein which are overexpressed 
on human breast and ovarian tumor cells in vitro. However, this is again late in the 
stage of the transition of normal epithelial cells to cancer. As described earlier, ERBB2 
expressing cancers usually progress rapidly and are fatal. Treatment and diagnosis needs 
to be at an earlier stage, while the cells are still only showing hyperplasia. 

20 

SUMMARY OF THE INVENTION 

The present invention provides a purified and isolated DNA-binding 
protein which specifically binds to the promoter region of the c-eriB-2 gene sequence 
25 (Her-2/wez/ promoter binding factor: HPBF). 

The present invention also provides antibodies which specifically bind 
HPBF. The present invention further provides a bioassay for determining the amount of 
HPBF in a biological sample comprising contacting the biological sample with a nucleic 
30 acid or antibody to which the HPBF binds under conditions such that an HPBF/nucleic 
acid complex or an HPBF/anttbody complex can be formed and determining the amount 
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of the complex, the amount of the complex indicating the amount of HPBF in the 
sample. 



The present invention also provides a method of detecting the presence 
5 of a cancer in a subject and determining the prognosis of a subject having cancer 
comprising determining the presence of a detectable amount of HPBF in a biopsy from 
the subject, the presence of a detectable amount of HPBF, relative to the absence of 
HPBF in a normal control indicating the presence of cancer and a decreased chance of 
long-term survival. 

10 

The present invention further provides a DNA isolate encoding HPBF. 

In addition, the present invention provides a bioassay for screening 
substances for ability to inhibit the activity of HPBF comprising administering the 
15 substance to a cell construct comprising the promoter region of ERBB2 linked to a 
reporter gene and an activated gene encoding HPBF and determining the amount of the 
reporter gene product and selecting those substances which inhibit the expression of the 
reporter gene product 

20 The present invention also provides a bioassay for screening substances 

for the ability to inhibit the mitogenic activity of HPBF in MH3T3 cells comprising 
administering the substance to the cells, administering HPBF to the cells, determining 
the mitogenic activity of HPBF in the substance-treated cells and selecting those 
substances which inhibit the mitogenic activity of HPBF in the cells. 

25 

The present invention further provides a bioassay for screening 
substances for the ability to the inhibit the production of HPBF comprising administering 
the substance to a cell having an activated gene encoding HPBF and determining the 
amount of HPBF produced and selecting those substances which inhibit the production 
30 of HPBF. 
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Finally, the pr.:vsnt invents provides a method of inhibiting a biological 
activity mediated by HPBF comprising preventing the HPBF from binding to the 
promoter region of the ERBB2 gene sequence wherein the binding to the promoter 
region is prevented by an antisense nucleotide sequence or wherein the binding to the 
5 promoter region is prevented by a nongenomic nucleic add sequence to which the 
HPBF binds. 



10 



BRIEF DESCRIPTION OF THE DRAWINGS 



15 



20 



Other advantages of the present invention will be readily appreciated as 
the same becomes better understood by reference to the following detailed description 
when considered in connection with the accompanying drawings wherein: 



region includin. 
boxes. The pi 
relative to the 



sepharose resin 



T*E 1 is a representation of a partial physical map of ERBB2 5' 
remoter area, where sev.?-*! binding factors are indicated in black 
which is the immediate : rioter region, spans - 22 to + 9 
nscription start site in thr . 3B2 promoter. 

"RE 2 presents the strategy used to construct specific DNA- 
g double stranded oligonucleotide (probe B). 



25 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 



30 



The present invention may be understood more readily by reference to 
the following detailed description of specific embodiments and the Examples and 
Figures included therein. 
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According to the present invention, a purified and isolated DNA-binding 
factor which specifically binds to the promoter region of the ERBB2 gene sequence 
(Her-2/neu promoter binding factor: HPBF) has been found, as detailed in Examples 1-4 
here below. (The factor has also been designated herein as ERBB2 promotor binding 
5 protein: EPBP and as Tumor Enhancer Factor: TEF.) The factor was determined to be 
a protein as detailed in Example 5 below. The protein includes a peptide generated by 
asp-N digest with an N-terminal ten amino add sequence of Aspartic Acid-Glycine- 
Aspartic acid-Asparagme-Phenylalaiune-Prolme^ 

(SEQ ID NO: 1) as detailed in Example 8 here below. Further, the protein includes a 
1 0 peptide generated by cyanogen bromide cleavage with an N-terminal ten amino add 
sequence of Lysine- Isoleucine- Alanine- Isoleucine- Glutamic add- Alanine- Glycine- 
Tyrosine- Aspartic add- Phenylalanine (SEQ ID NO:2) as detailed in Example 8 here 
bdow. 

15 The isolated protein has a molecular wdght of about 44,000-47,000 

daltons as measured by SDS-PAGE. Further the protein binds specifically to a double 
stranded-DNA (ds-DNA) probe of sense and anti-sense oligonucleotides having the 
sense sequence: 

5' — TAC-GAATGAAGTf GTGAAGCTGAGATTCCCCTC 
20 C— 3* (SEQ ID NO:3) and the anti-sense sequence 

y CTT AC TTC AAC ACTTCGACTCT A AGGGGAGG- 

C A T— 5' (SEQ ID NO:4), as detailed in Example 7 below. Microinjection into NB- 
3T3 cells of the purified protein causes the induction of DNA synthesis in quiescent 
NIH-3T3 cells, as detailed in Example 9 below. 

25 

The DNA-binding protein (HPBF) is purified and isolated from tumor 
tissues using a ds-DNA probe of sense and anti-sense oligonucleotides having the sense 
sequence: 

5' — TAC-G A ATGA AGTTGTG AAGCTGAGATTC CCCTC 
30 C~3' (SEQ ID NO:3) and the anti-sense sequence 

y CTTACTTC AAC ACTTCGACTCT AAGGGGAGG- 
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C A T— 5* (SEQ ID NO:4) as more fully detailed in Example 6. 

This DNA-binding protein has been detected at high concentrations in 
samples of adenocarcinoma-admixed with carcinoma in situ of the breast, whereas the 
5 apparently benign breast tissue from the same quadrant area shows very minimal (almost 
unidentifiable) presence of this protein, and has also been found in the sera of patients 
with breast cancer, as detailed in Examples 2, 3 and 10. These studies indicate that this 
DNA-binding protein is specifically interacting with the promoter region of the ERBB2 
gene during the transition of normal epithelial cells towards carcinoma in situ and 
10 subsequently to the development of invasive breast carcinoma and the protein is soluble 
and excreted into the serum. The protein, therefore, provides an earlier indication of 
transition to a cancerous state than the gene product of the ERBB2 gene itself 

The present invention also provides an antibody that is specifically 
15 reactive with HPBF. "Specifically reactive," as used herein describes an antibody or 
other ligand that specifically binds the HPBF protein and does not crossreact 
substantially with any antigen other than the HPBF protein. Antibody can include 
antibody fragments such as Fab fragments which retain the binding activity. 

20 The antibody can be bound to a solid support substrate or conjugated 

with a detectable moiety or therapeutic compound or both bound and conjugated. Such 
conjugation techniques are well known in the art. For example, conjugation of 
fluorescent or enzymatic moieties can be performed as described in Johnstone & 
Thorpe, Immunochemistry m Practice, BladcweD Scientific Publications, Oxford, 1982. 

25 

The binding of antibodies to a solid support substrate is also well known 
in the art. (See, for example, Harlow and Lane, Antibodies; A Laboratory Manual, . . 
Cold Spring, Harbor Laboratory, Cold Spring Harbor, New York, 1988). The 
detectable moieties contemplated with the present invention can include fluorescent, 
30 enzymatic and radioactive markers. Therapeutic drugs contemplated with the present 
invention can include cytotoxic moieties such as ricin A chain, diphtheria toxin and 
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detectabl moieties contemplated with the present invention can include fluorescent, 
enzymatic and radioactive markers. Therapeutic drugs contemplated with the present 
invention can include cytotoxic moieties such as ricin A chain, diphtheria toxin and 
chemotherapeutic compounds. Such therapeutic drugs can be utilized for killing cancer 
5 cells expressing HPBF. 

Immunoassays 

Immunoassays such as immunofluorescence assays, radioimmunoassays 
(RIA), immunoblotting and enzyme linked immunosorbent assays (EUSA) can be 

10 readily adapted to accomplish the detection of HBPF. In general, ELISAs are the 
preferred immunoassays employed to assess the amount of HBPF in a specimen. Both 
polyclonal and monoclonal antibodies can be used in the assays. An FT ISA method 
effective for the detection of HBPF protein can, for example, be as follows: (1) bind the 
antibody to a substrate; (2) contact the bound antibody with a fluid or tissue sample 

1 5 containing. the antigen; (3) contact the above with secondary antibody bound to a 
detectable moiety (e.g., horseradish peroxidase enzyme or alkaline phosphatase 
enzyme); (4) contact the above with the substrate for the enzyme; (5) contact the above 
with a color reagent; and (6) observe color change. Available immunoassays are 
extensively described in the patent scientific literature. See, for example, United States 

20 Patents 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 3,867,517; 3,879,262; 
3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; and 4,098,876. 

Bioassavs for Determining the Amount of 
HPBF in a Biological Sample 
25 The present invention provides a method of determining the amount of 

HPBF in a biological sample comprising the steps of contacting the biological sample 
with a substance which binds HPBF under conditions such that a complex between . 
HPBF and the substance can be formed and determining the amount of the complex, the 
amount of complex indicating the amount of HPBF in the sample. 

30 

As contemplated herein, a biological sample includes any body fluid 
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which would contain the HPBF protein, such as blood, plasma, serum, and urineor any 
cell containing the HPBF protein. Examples of cells include tissues taken from surgical 
biopsies or isolated from a body fluid. 

5 One example of the method of determining the amount of HPBF in a 

biological sample is performed by contacting the biological sample with a nucleic add 
which binds HPBF under conditions to form a complex and determining the amount of 
HPBF/nucleic acid complex, the amount of the complex indicates the amount of HPBF 
in the sample. Nucleic acid sequences which bind HPBF to form a complex can be 
10 identified as described hereon in the Examples. For example, the nucleic acid sequence 
ofSEQ ID NO;3 binds HPBF as described herein. 

Determination of the amount of HPBF/nucleic acid complex can be 
accomplished through techniques standard in the art For example; the complex may be 
IS precipitated out of a solution or detected by the addition of a detectable moiety 

conjugated to the nucleic acid, as described, for example in Sambrook et aL t Molecular 
Cloning, A Laboratory Manual, Cold Springs Harbor, New York, 1989). 

Another example of the method of determining the amount of HPBF in a 
20 biological sample is performed by contacting the biological sample with an antibody 
against HPBF under conditions such that a specific complex of an antibody and HPBF 
can be formed and determining the amount ofHPBF/antibody complex, the amount of 
the complex indicating the amount of HPBF in the biological sample. Antibodies which 
bind HPBF can be either monoclonal or polyclonal antibodies and can be obtained as 
25 described herein in the Examples. Determination of HPBF/airtibody complexes can be 
accomplished using the immunoassays as described herein in the Examples. 

The present invention also provides a method of detecting the presence 
of a cancer in a subject comprising determining the presence of a detectable amount of . • 
30 HPBF in a biopsy from the subject, the presence of a detectable amount of HPBF, 

relative to the absence of HPBF in a normal control, indicating the presence of a cancer. 
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The method of determining the presence of a detectable amount of HPBF in a biopsy 
from the subject comprises the methods of determining the amount of HPBF in a 
biological sample as described herein in the Examples. As used herein, "biopsy" means 
any body fluids or cells which may contain HPBF which have been removed from the 
5 subject suspected of having a cancer. Also, as used herein, "detectable amount" means 
any amount of HPBF which is detectable by the methods of detection of HPBF 
described herein, as compared to the absence of a detectable amount of HPBF in a 
normal control biopsy taken from the same subject. When a normal biopsy sample and a 
suspected cancerous biopsy sample are removed from the same subject, any amount of 
10 HPBF present in the suspected sample, in greater quantities than an amount of HPBF 
detected in a normal sample, is considered a detectable amount. A detectable amount of 
HPBF is indicative of the presence of cancer, based on results of numerous studies as 
cited herein. 

15 The present invention further provides a method of determining the 

prognosis of a subject having cancer comprising determining the presence of a 
detectable amount of HPBF in a biopsy from the subject, the presence of a detectable 
amount of HPBF, relative to the absence of HPBF in a normal control indicating a 
decreased chance of long-term survival. A detectable amount of HPBF is indicative of 

20 decreased chance of long-term survival based on the statistical correlations as described 
herein. 

Isolation of DNA Encoding HPBF 
The present invention provides an isolated nucleic add encoding HPBF. 
25 By "isolated" is meant separated from other nucleic adds found in humans. The nucleic 
add encoding HPBF is specific for humans expressing HPBF. By "specific" is meant an 
isolated sequence which does not hybridize with other nucldc adds to prevent an 
adequate hybridization with the nucleic acid encoding HPBF. 

30 The isolated nucleic add encoding HPBF can be obtained by standard 

methods well known in the art. For example, a library of cDNA clones can be generated 
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and expressed in K coti bacteria. Specific clones expressing HPBF or fragments thereof 
can be screened on colony blots using antibodies against HPBF generated as described 
in the Examples herein. Positive clones can then be sequenced by standard methods and 
the entire genes sequence of HPBF can be determined. (See, Sambrook et aL 9 
5 Molecular Cloning, A Laboratory Manual, Cold Springs Harbor, New York, 1989). 



Also provided is an isolated nucleic add that selectively hybridizes with 
the nucleic add encoding HPBF under stringent conditions and has at least 70% and 
more preferably 80% and 90% complementarity with the segment and strand of the 

10 nucleic add of HPBF to which it hybridizes. As used herein to describe nucleic adds 
the term "selectively hybridizes" excludes the occasional randomly hybridizing nucleic 
adds as well as nucldc acids that encode other known promoter binding factors. 
Because the HPBF-encoding nucldc add is double stranded, the selectively hybridizing 
nudeic add can hybridize with either strand when the two strands of the coding 

IS sequence are not hybridized to each other. The selectively hybridizing nucleic adds can 
be used, for example, as probes or primers for detecting the presence of a sample that 
has a nuddc add to which it hybridizes. Alternatively, the nucldc add can encode a 
segment of the HPBF protein. The conditions of hybridization are stringent, but may 
vary depending on the length of the nucleic acids. 

20 

Modifications to the nuddc adds of the invention are also contemplated 
as long as the essential structure and function of the polypeptide encoded by the nuddc 
adds are maintained. Likewise, fragments used as primers or probes can have 
substitutions as long as enough complementary bases exist for selective hybridization 
25 (Kunkd et al, Methods EnzymoL, 154:367 (1987)). 



Bioassavs 

The present invention provides a bioassay for screening substances for 
their ability to inhibit the activity of HPBF. Briefly, this can be accomplished by 
30 cotransfection assays whereby a plasmid containing a promoter gene, such as the 

bacterial chloramphoiicolacetyltransferase (CAT) gene, doned directly downstream of 
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the ERBB2 promoter, can be cotransfected into a cultured cell line, such as COS7 cells, 
with a second plasmid which has a promoter known to be active in the cultured cells, 
cloned directly upstream of the HPBF gene. In such an assay, the HPBF gene encoding 
the HPBF transcript will be transcribing HPBF messenger SNA which will then be 
5 translated into HPBF protein. The HPBF protein then will be activating transcription of 
the reporter gene through its interaction with the ERBB2 promoter. The products of 
the reporter gene transcripts can then be quantitated. Such techniques for cotransfection 
and detection of CAT gene products in cultured cell lines are very well known in the 
art 98 " 101 . A cotransfected cell culture can then be contacted with compounds to screen 
10 than for the ability to inhibit the activity of HPBF. A compound which inhibits the 
activity of HPBF will inhibit the interaction of HPBF with the ERBB2 promoter. This 
decreased interaction is quantifiable by monitoring the CAT enzyme produced as a result 
of transcription directed by the ERBB2 promoter. 

1 5 The present invention also provides a bioassay for screening substances 

for the ability to inhibit the mitogenic activity of HPBF in cultured NDDT3 cells. 
NIH3T3 cells are highly sensitive to sarcoma virus formation and HPBF is known to 
produce mitogenic effect when introduced into these cells 102,103 . Briefly, quiescent 
NIH3T3 cultured cells are microinjected with HPBF and observed for any mitogenic 

20 effect, such as the formation of morphologically recognizable foci (cells no longer 
growing in an organized manner and as a monolayer, but contact inhibited and 
disorganized, eventually growing in disorganized multiple layers). Alternatively, DNA 
synthesis levels can be monitored both pre and post-injection as a direct measure of 
changes in genome replication 103 , 

25 

Using this mitogenic assay, one can screen substances for their ability to 
inhibit the known mitogenic activity of HPBF. Such substances can be co-injected into 
quiescent MH3T3 cells with HPBF and the mitogenic activity can then be compared to 
the mitogenic activity of HPBF or such substance injected alone. One can then readily 
30 determine whether a substance has an inhibitory effect on the mitogenic activity of 
HPBF. 
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Inhibition of Biological Activity of HFRF 
The present invention provides a method of inhibiting a biological activity 
mediated by HPBF comprising preventing the HPBF from binding to the promoter 
region of the ERBB2 gene sequence. 

5 

In one example, the present invention provides a method of inhibiting a 
biological activity mediated by HPBF comprising preventing the HPBF from binding to 
the promoter region of the ERBB2 gene sequence wherein the binding to the promoter 
region is prevented by an antisense nucleotide sequence. The antisense oligonucleotide 
10 can be generated using well known nucleic acid synthesis methods as demonstrated in 
the Examples. 

In another example, the present invention provides a method of inhibiting 
a biological activity mediated by HPBF comprising preventing the HPBF from binding 
15 to the promoter region of the ERBB2 gene sequence wherein the binding to the 
promoter region is prevented by a nongenomic nucleic acid sequence to which the 
HPBF binds. 

A method to inhibit a biological activity of HPBF and decrease ERBB2 
20 activity can use antisense or triplex oligonucleotide analogues or expression constructs. 
This entails introducing into the cell a nucleic acid sufficiently complementary in 
sequence so as to selectively hybridize to the target gene or message. Triplex inhibition 
relies on the transcriptional inhibition of the target gene and can be extremely efficient 
. since only a few copies per cell are required to achieve complete inhibition. Antisense 
25 methodology on the other hand inhibits the normal processing, translation or half-life of 
the target message. Such methods are well known to one skilled in the art. 

Although longer sequences can be used to achieve inhibition, antisense 
and triplex methods generally involve the treatment of cells or tissues with a relatively 
30 short oligonucleotide. The oligonucleotide can be either deoxyribo- or ribonucleic add 
and must be of sufficient length to form a stable duplex or triplex with the target RNA 
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or DNA at physiological temperatures and salt concentrations. It should also be of 
sufficient complementarity selectively hybridize to the target nucleic acid. 
Oligonucleotide lengths sufficient to achieve this specificity are generally about 12 to 60 
nucleotides long, preferably about 18 to 32 nucleotides long. In addition to length, 
5 hybridization specificity is also influenced by GC content and primary sequence of the 
oligonucleotide. Such principles are well known in the art and can be routinely 
determined by one who is skilled in the art. 

The composition of the antisense or triplex oligonucleotides can also 
10 influence the efficiency of inhibition. For example, it is preferable to use 

oligonucleotides that are resistant to degradation by the action of endogenous nucleases. 
Nuclease resistance will confer a longer in vivo half-life onto the oligonucleotide and 
therefore increase its efficacy by reducing the required dose. Greater efficacy can also 
be obtained by modifying the oligonucleotide so that it is more permeable to cell 
1 5 membranes. Such modifications are well known in the art and include the alteration of 
the negatively charged phosphate backbone of the oligonucleotide to uncharged atoms 
such as sulfur and carbon. Specific examples of such modifications include 
oligonucleotides that contain methylphosphonate and thiophosphonate moieties in place 
of phosphate. These modified oligonucleotides can be applied directly to the cells or 
20 tissues to achieve entry into the cells and inhibition of HPBf activity. Other types of 
modifications exist as well and are known to one skilled in the art 

Recombinant methods known in the art can also be used to achieve the 
antisense or triplex inhibition of a target nucleic acid. For example, vectors containing 

25 antisense nucleic acids can be employed to express protein or antisense message to 
reduce the expression of the target nucleic add and therefore its activity. Such vectors 
are known or can be constructed by those skilled in the art and should contain all 
expression elements necessary to achieve the desired transcription of the antisense or 
triplex sequences. Other beneficial characteristics can also be contained within the 

30 vectors such as mechanisms for recovery of the nucleic acids in a different form. 

Phagemids are a specific example of such beneficial vectors because they can be used 
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either as plasmids or as bacteriophage vectors. Examples of other vectors include 
viruses such as bacteriophages, baculoviruses and retroviruses, DNA viruses, cosmids, 
plasmids, liposomes and other recombination vectors. The vectors can also contain 
elements for use in either procaryotic or eucaryotic host systems. One of ordinary skill 
5 in the art will know which host systems are compatible with a particular vector. 

The vectors can be introduced into cells or tissues by any one of a variety 
of known methods within the art. Such methods can be found described in Sambrook et 
aL, Molecular Cloning: A Laboratory Manual, Cold Springs Harbor Laboratory, New 

10 York (1992), in Ausubel et.al., Current Protocols in Molecular Biology, John Wiley 
and Sons, Baltimore, Maryland (1989), and include, for example, stable or transient 
transfection, lipofection, electroporation and infection with recombinant viral vectors. 
Introduction of nucleic acids by infection offers several advantages over the other listed 
methods. Higher efficiency can be obtained due to their infectious nature. Moreover, 

1 5 viruses are very specialized and typically infect and propagate in specific cell types. 
Thus, their natural specificity can be used to target the antisense vectors to specific cell 
types in vivo or within a tissue or mixed culture of cells. Viral vectors can also be 
modified with specific receptors or ligands to alter target specificity through receptor 
mediated events. 

20 

A specific example of a DNA viral vector for introducing and expressing 
antisense nucleic adds is the adenovirus derived vector Adenop53TK. This vector 
expresses a herpes virus thymidine kinase (TK) gene for either positive or negative 
selection and an expression cassette for desired recombinant sequences such as antisense 
25 sequences. This vector can be used to infect cells that have an adenovirus receptor 

which includes most cancers of epithelial origin as well as others. This vector as well as 
others that exhibit similar desired functions can be used to treat a mixed population of 
cells can include, for example, an in vitro or ex.vivo culture of cells, a tissue or a human 
subject. 

30 



Additional features can be added to the vector to ensure its safety and/or 



WO 95/28485 



PCT/US95/04953 



24 

enhance its therapeutic efficacy. Such features include, for example, markers that can be 
used to negatively select against cells infected with the recombinant virus. An example 
of such a negative selection marker is the TK gene described above that confers 
sensitivity to the antibiotic gancyclovir. Negative selection is therefore a means by 
5 which infection can be controlled because it provides inducible suicide through the 
addition of antibiotic. Such protection ensures that i£ for example, mutations arise that 
produce altered forms of the viral vector or antisense sequence, cellular transformation 
will not occur. Features that limit expression to particular cell types can also be 
included. Such features include, for example, promoter and regulatory elements that are 
10 specific for the desired cell type. 

Recombinant viral vectors are another example of vectors useful for in 
vivo expression of a desired nucleic acid because they offer advantages such as lateral 
infection and targeting specificity. Lateral infection is inherent in the life cycle of; for 

1 S example, retrovirus and is the process by which a single infected cell produces many 
progeny virions that bud off and infect neighboring cells. The result is that a large area 
becomes rapidly infected, most of which were not initially infected by the original viral 
particles. This is in contrast to vertical-type of infection in which the infectious agent 
spreads only through daughter progeny. Viral vectors can also be produced that are 

20 unable to spread laterally. This characteristic can be useful if the desired purpose is to 
introduce a specified gene into only a localized number of targeted cells. 

As described above, viruses are very specialized infectious agents that 
have evolved, in many cases, to elude host defense mechanisms. Typically, viruses 

25 infect and propagate in specific cell types. The targeting specificity of viral vectors 
utilizes its natural specificity to specifically target predetermined cell types and thereby 
introduce a recombinant gene into the infected cell. The vector to be used in the 
methods of the invention will depend on desired cell type to be targeted. For example, if 
breast cancer is to be treated by decreasing the HPBF activity of cells affected by the 

30 disease, then a vector specific for such epithelial cells should be used. Likewise, if 

diseases or pathological conditions of the hematopoietic system are to be treated, then a 
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viral vector that is specific for blood cells and their precursors, preferably for the specific 
type of hematopoietic cell, should be used. 

Retroviral vectors can be constructed to function either as infectious 
5 particles or to undergo only a single initial round of infection. In the former case, the 
genome of the virus is modified so that it maintains all the necessary genes, regulatory 
sequences and packaging signals to synthesize new viral proteins and RNA Once these 
molecules are synthesized, the host cell packages the RNA into new viral particles which 
are capable of undergoing further rounds of infection. The vector's genome is also 

10 engineered to encode and express the desired recombinant gene. In the case of non- 
infectious viral vectors, the vector genome is usually mutated to destroy the viral 
packaging signal that is required to encapsulate the RNA into viral particles. Without 
such a signal, any particles that are formed will not contain a genome and therefore 
cannot proceed though subsequent rounds of infection. The specific type of vector will 

IS depend upon the intended application. The actual vectors are also known and readily 
available within the art or can be constructed by one skilled in the art using well-known 
methodology. 

HPBF antisense-encoding viral vectors can be administered in several 
20 ways to obtain expression and therefore decrease the activity of HPBF in cells affected 
by the disease or pathological condition. If viral vectors are used, for example, the 
procedure can take advantage of their target specificity and consequently, do not have 
to be administered locally at the diseased ate. However, local administration can 
provide a quicker and more effective treatment, administration can also be performed 
25 by, for example, intravenous or subcutaneous injection into the subject . Injection of the 
viral vectors into the spinal fluid can also be used as a mode of administration, especially 
in the case of neurodegenerative diseases. Following injection, the viral vectors will 
circulate until they recognize host cells with the appropriate target specificity for 
infection. 

30 

An alternate mode of administration of HPBF antisense-encoding vectors 
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can be by direct inoculation locally at th sit of the disease or pathological condition or 
by inoculation into the vascular system supplying the tumor with nutrients. Local 
administration is advantageous because there is no dilution effect and, therefore, a 
smaller dose is required to achieve HPBF expression in a majority of the targeted cells. 
5 Additionally, local inoculation can alleviate the targeting requirement required with 
other forms of administration since a vector can be used that infects all cells in the 
inoculated area. If expression is desired in only a specific subset of cells within the 
inoculated area, then promoter and regulatory elements that are specific for the desired 
subset can be used to accomplish this goal. Such non-targeting vectors can be, for 
10 example, viral vectors, viral genome, plasmids, phagemids and the like. Transfection 
vehicles such as liposomes can also be used to introduce the non-viral vectors described 
above into recipient cells within the inoculated area. Such transfection vehicles are 
known by one skilled within the art. 

15 In addition to the antisense methods described above, other methods can 

be used as well to decrease the activity of HPBF and achieve the down regulation of 
ERBB2 activity. For example, oligonucleotides which compete for the HPBF binding 
she within the ERBb2 regulatory elements can be used to competitively inhibit HPBF 
binding to ERBB2. Such oligonucleotides can be, for example, methylphosphonates and 

20 thiophosphonates which permeate the cell membrane. Alternatively, vectors which 
express such sequences or contain the HPBF binding element can also be used to 
achieve the same result as the oligonucleotides. Modes of administration for the 
competitive inhibition are similar to that described above for the antisense vectors and 
oligonucleotides. 

25 

The present invention also provides for a bioassay for screening 
substances for the ability to inhibit the production of HPBF comprising administering the 
substance to a cell having a gene activity expressing the HPBF gene (an activated gene 
encoding HPBF) and then determining the amount of HPBF subsequently produced. 

30 

Stabely transformed cell lines expressing HPBF can be constructed in 
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several ways. One example of such a technique is integrating genetic material known to 
encode HPBF into the chromosome of a host cell. Such integration, usually mediated 
through transection of the DNA by DEAE Dextran, Calcium Phosphate precipitation, 
or via liposome encapsulation, can be coupled to the introduction of genes utilized to 
5 enhance gene expression. For example, the metabolic inhibitor, dihydrofolate reductase 
can be selected as the cotransfecting DNA to achieve DNA amplification and therefore 
enhanced or activated gene expression. In such a system, co-transfected cells are 
treated with methotrexate, a known inhibitor of dihydrofolate reductase. Cells resistant 
to methotrexate obtain this resistance by amplifying the numbers of dihydrofolate 
10 reductase genes. Genes other than the dihydrofolate gene are amplified as well m . 

Amplification of the cotransfected gene can be verified in several ways. 
These techniques can be, but are not limited to quantitative polymerase chain reaction, 
Southern blot hybridization, and dot blot hybridization. The presence of enhanced levels 
15 of HPBF protein can also be detected. One example of such a technique is through 
separating cellular proteins by pofyacrylamide gel electrophoresis, either single or two 
dimensional, and then visualized by staining, or through antigen-antibody interaction. 
Such techniques are very well known in the art (Sambrook et al, Molecular Cloning, A 
laboratory Manual, Cold Springs Harbor, New York, 1989). 

20 

Cells expressing HPBF can then be contacted with substances to screen 
for those which decrease the amount of HPBF produced. Techniques for detecting a 
change in the amount of HPBF produced can be, but are not limited to polyacrylamide 
gel electrophoresis, enzyme linked immunosorbent assay and by bioassay. 

25 

The invention will now be demonstrated by the following non-restrictive 

examples: 

The present invention is more particularly described in the following 
30 examples which are intended as illustrative only since numerous modifications and 
variations therein will be apparent to those skilled in the art 
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EXAMPLES 

GENERAL METHODS 

Preparation of Cytoplasmic and Nuclear Extracts 

The cytoplasmic and nuclear extracts from tissues and cells were 
5 prepared following standard procedures. 92 Briefly, cells were trypsinized (lxl 0 9 ) and 
centrifuged at 5,500 rpm for 10 minutes. The supernatant was discarded and the pellet 
washed twice in 5x volume of phosphate buffered saline (PBS). Centrifugation step was 
repeated. The cell pellet was resuspended in 5x pellet volume of ice-cold buffer A 
(15mM KC1, lOmM Hepes, 2mM MgCl 2> 0. ImM EDTA). All remaining steps were 

10 performed at 4°C. The cells and tissues were homogenized using a glass-glass dounce 
homogenizes The homogenization was complete when >85% of the cells were lysed as 
determined by phase contrast microscopy. The homogenate was mixed with 1/10 vol of 
buffer B (1M KC1, 50mM Hepes, 30mM MgCl* 0. ImM EDTA, ImM DTT) and left on 
ice for 4-5 minutes followed by centrifugation at 10,000 rpm for 10 minutes. The 

1 5 supernatant was reserved for cytoplasmic extraction. The nuclear pellet was 

resuspended in 5 ml in a buffer of 9 parts buffer A and 1 part buffer B. Ammonium 
sulphate (4M, pH 7.9) was added to the extract to a final concentration of 0.36M and 
the nuclear proteins were extracted by gentle rocking on a shaker at 4°C for 30 minutes. 
The DNA was separated from the proteins by centrifugation of the lysate at 150,000g 

20 for 60 minutes. The supernatant was coDected and the proteins were precipitated by the 
addition of 0.25 g ammonium sulphate per ml of supernatant The precipitated proteins 
were collected by centrifugation at 150,000g for 15 minutes and suspended in one-half 
of the original cell pellet volume in buffo- C (10% Glycerol, 25mM Hepes (pH 7.6), 
40mM KC1, 0. ImM EDTA, ImM DTT). The proteins were dialyzed against Buffer C 

25 for 2-4 hours, collected in a tube and centrifuged at 10,000 rpm for 10 minutes. Protein 
concentration was determined by Bio-Rad® protein reagents and the extract was stored 
in smaller aliquots at -70° C. 

For cytoplasmic extraction of the reserved supernatant, 5 g of ammonium 
30 sulfate was added per 10 ml of supernatant and dissolved by gentle shaking at 4°C, The 
supernatant was then centrifuged the same way as for nuclear extract preparation. The 
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precipitate was suspended in Buffer C and dialyzed against Buffer C as for nuclear 
extract preparation. 

Preparation of Double Stranded Oligonucleotides 
5 An aliquot of equal moles of sense and anti-sense oligonucleotides in 

H 2 0 was mixed and the mixture was incubated sequentially at 95- 100°C for 10 
minutes, at 65 °C for 1 hour, 37°C for 2-3 hours and at RT for 5 hours to form the 
double stranded (ds) oligonucleotides. The DNA was precipitated by the addition of 
0.3M NaOAC and 2.5 vol of 100% ETOH. The precipitated DNA was collected by 
* 1 0 centrifugation and washed once with 70% ETOH and the pellet was dried under 

vacuum. The DNA was suspended in Hfi and the exact concentration is determined by 
spectrophotometry. 

5' End Labelling of Double Stranded Oligonucleotides 

15 The 5' end labelling was accomplished essentially according to the 

manufacturer's protocol (Stratagene) using a-^P-AT? and the probe was purified 
through gel extraction. The labeled oligonucleotide was separated through an 8- 10% 
PAGE in lx TBE (Tris-borate-EDTA buffer). Loading of the samples was done by 
mixing with 5x dye. 93 Electrophoresis was continued at 30-36 mA for about 2-4 hours 

20 and the gel was exposed to Kodak® XAR-5 film and developed after about 10 minutes 
of exposure. The ds oligonucleotide band was cut from the gel, cut into smaller pieces 
and mixed with two volumes of a mixture containing 0.5M NH4OAC and ImM EDTA 
and allowed to shake at 37°C overnight The whole suspension was passed through 
glass wool in a 3 ml syringe and the clear radioactively labeled DNA solution was 

25 collected. Yeast tRNA, to a final concentration of 30-40 /ig/ml, was added to the 
. labelled DNA and precipitated with 2.5 volume of ETOH overnight at -20°C. The tube 
was then centrifuged, the pellet washed once with 70% ETOH, and vacuum dried. The 
vacuum dried pellet was suspended in TE and the radioactivity was determined by 
counting an aliquot 

30 
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Gel MpWity Shift Asgay (GMSA) 

The tissue or cell extract was mixed with 5x binding buffer (125 mM 
HEPES, pH 7, containing 50 mM KC1, 5 mM DTT, 5 mM EDTA, 50% Glycerol and 
0.25% NP-40), poly dI:dC (1-2 ng) and H 2 0, and the mixture incubated at RT for 10 
5 minutes in a reaction volume of 20-25 §A. The labelled probe (12,000-15,000 cpm) 
was then added to the mixture and the reaction was continued at RT for 40 minutes. At 
the end of the reaction time, 1 //I of 5x dye was added and loaded on a 6% pre-run 
PAGE in Ix TBE. The electrophoresis was continued at 32-36 mAmp. The gel was 
dried and exposed to the X-ray film. 

10 

Southwestern (DNA-Protein) Blot Assay 

For the Southwestern procedure, the cytoplasmic or nuclear proteins 
were separated on SDS-PAGE (10% separating gel) 93 under reducing conditions and 
the proteins were electrotransferred onto nylon membrane (Immobilon® P membrane). 

15 < The membrane was washed three times (one hour each) with renaturation buffer (lOmM 
Trisflcl, pH 7.5, 150mM NaCl, lOmM DTT, 2.5% NP-40, 10% Glycerol and 5% 
nonfat dry milk) and rinsed briefly in binding buffer (lOmM Tris-Hd, pH 7.5, 40mM 
NaCl, ImM DTT, ImMEDTA, 8% Glycerol and 0.125% non-fet diy milk). The 
membrane was then incubated in 15 ml of binding buffer plus 45 fig poly (dl-dC), 5xhM 

20 MgOj and 1 x 10 6 cpm of 32 P-labelled DNA probe per ml for 15 hours at RT with 
continuous agitation. The membrane was washed four times (30 minutes each) in 
lOmM Tris-Hcl, pH 7.5 containing 50mMNaCI and exposed to X-ray film. 

Preparation of Sequence-Specific DNA-Senharose Resin 
25 Chemically synthesized complementary oligonucleotides corresponding 

to -22 to +9 sequences (see Examples) of ERBB2 were annealed, 5-phosphoryiated, 
ligated and coupled to CNBr-activated sepharose 4B essentially according to the 
method of Kadonaga and Tjian 94 



30 



Affinity Purification of Sequence-Specific DNA-binding Protdn 

All operations were performed at 4°C. The oligonucleotide-aflBnity resin 



WO 95728485 



PCT/US95/04953 



31 

(1 ml) was equilibrated with buffer Z (0. 1 M KC1, 25 mM HEPES pH 7.6, 12.5 mM 
MgCl 2 , 15% glycerol, 1 mM DTT and 0.05% NP-40). Cytoplasmic and/or nuclear 
extracts (10 ml) were dialyzed against buffer Z, combined with 250 Mg of salmon sperm 
DNA and allowed to stand for 10 minutes on ice. This protein-DNA mixture was then 
5 mixed with the ERBB2-sepharose resin for 5-8 hours at 4°C with occasional shaking 
and then loaded onto a column The mixture was allowed to elute under gravity flow 
and washed with 4 to 5 column bed volumes of buffer Z. At this stage, the column was 
stopped, buffer Z containing 1M KC1 (10 ml) was added and mixed with the resin 
thoroughly. The resin was allowed to stand for 15 minutes with occasional mixing and 
10 then the protein was ehited. This first cycle higher salt eluate was diluted in 0. 1 M KC1 
buffer Z, mixed with salmon sperm DNA and the whole procedure was repeated for 
second cycle purification identical to the first cycle. 

Cell Lines and Primary Tumor Tissue 
1 5 CeU lines NIH-3T3, (ATCC Accession No. CRL 1 658) and SKBR3 

(ATCC Accession No. HTB 30) were used. Primary breast cancer samples were 
obtained from mastectomy specimens. Pathology of each sample was confirmed using 
H&E stained frozen as well as formalin fixed tissue sections. 

20 EXAMPLE 1 

Preparation of Probes 
In order to identify specific factors) that are responsible for the 
regulation of the ERBB2 gene, three sets of sense and anti-sense ds- 
oligonucleotides based on the DNA sequence of a genomic clone of the ERBB2 
25 promoter region entered in the Genbank were prepared. The promoter DNA sequence 
. was analyzed through a Genbank data search. 21 The Genbank Accession numbers were 
M16789 95 and M16892 96 . The DNA sequences of these three sets of oligonucleotides 
are indicated below and a map is shown in Figure 1 . 

30 The first sets were from base -79 to +9, relative to the last transcription 

start she (+1). The last transcription start site is 1 cated at position -178 relative to the 
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first translational start codon W ATG". Therefore, the first set f oligonucleotides are 
from -258 to -169 relative to the first translational start codon "ATG H . Position -178 is 
located at 21 bp downstream from the last TATAA box (-204 to -200 relative to the 
translational start codon). This set (Set 1, Probe C) of oligonucleotides consists of 
5 DNA sequences from the transcriptional start site, including TATAA and CAAT boxes. 
The second set (Set 2, Probe A) was from the same region, excluding TATAA and 
CAAT boxes (-79 to -22 relative to the transcriptional start she). The third set (Set 3, 
Probe B) of oligonucleotides was also from the same region excluding TATAA and 
CAAT boxes, but including transcriptional start site (-22 to +9), and including 
10 immediate base sequences upstream from the transcriptional start she, phis a few bases 
downstream of the transcriptional start site. 

Set No. 1 to create probe C: 
Sense Sequence: contains a three nucleotide 5' overhang. 
15 5' — GCT-CCC AATC AC AGGAGAAGGAGGAGGTGGAGGA 
GGAGGGCTGCTTGAGGAAGTATAAGAATGAAGTTGTG 
AAGCTGAGATTCCCCTC C — 3 '(SEQ ID NO:5) 

Antisense Sequence: contains a three nucleotide 5' overhang. 

20 3' GGGTTAGTGTCCTCTTCCTCCTCCACCTCCTCC 

TCCCGACGAACTCCTTCATATTCTTACTTCAACACTTC 
GACTCTAAGGGGAGG-CA T — 5' (SEQ ID NO:6) 

Set No. 2 to create probe A: 
25 Sense Sequence: contains a three nucleotide 5' overhang 

5 1 — GCT-CCC AATC AC AGGAGAAGGAGGAGGTGGAGGA 

GGAGGGCTGCTTG 

AGGAAGTATAAG A — 3* (SEQ ID NO:7) 

30 Antisense Sequence: contains a three nucleotide 5' overhang. 

3' GGGTTAGTGTCCTCTTCCTCCTCCACCTCCTCC 
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TCCCGACGAACTCCTTCATATTCT-CA T — 5' (SEQIDNO:8) 

Set No. 3 to create probe B : 
Sense Sequence: contains a three nucleotide 5* overhang, 

5 5' — TAC-GAATGAAGTTGTGAAGCTGAGATTCCCCTC 
C— 3* (SEQ ID NO:3) 

Antisense Sequence: contains a three nucleotide 5* overhang. 

3' CTTACTTCAACACTTCGACTCTAAGGGGAGG- 

10 C A T— 5' (SEQ ID NO:4) 

The sequence and location of probe B is indicated in Figure 1. The 
position for SP1 binding sites and the classical CAAT and TATAA box is also indicated. 
All three sets of these oligonucleotide were used to generate double stranded DNA 
1 5 (ds-oligonucleotide). 

EXAMPLE 2 

Analysis bv GMSA 
Radioisotopically ( 32 P) labelled ds-oligonucleotide probes were made and 
20 Gel Mobility Shift Assays (GMSA) were carried out. For initial experiments, nuclear 
and cytoplasmic extracts were made from a benign specimen (normal) and a paired 
specimen of benign and tumor (adenocarcinoma admixed with carcinoma in situ), freshly 
collected from breast mastectomies, as well as SKRB3 cell extracts. 

25 Nuclear and cytoplasmic extracts from a benign specimen and from a 

paired specimen ofbenign and tumor (pathologically diagnosed as adenocarcinoma) 
from the breast were analyzed by GMSA using all three probes. Probe B identified a 
specific factor which is present only in the nuclear and cytoplasmic extract of the tumor 
sample. The presence of this factor was totally absent in the nuclear extracts ofbenign 

30 tissue. However, the cytoplasmic extracts of both of the benign tissue samples show the 
presence of this factor at an extremely low level. 
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EXAMPLE 3 

Further GMSA Analysis with Probe B 
A series of four breast specimens of paired benign (B) and tumor (T) was 
analyzed similarly using GMSA and utilizing Probe B. The benign and tumor tissues 
5 were taken from the same quadrant area of the excised tissue. The histopathology 
examination identified the apparently benign area for use in the assay. Nuclear and 
cytoplasmic extracts from an atypical hyperplastic breast specimen were included. 

These results clearly show the presence of a probe-B-specific binding 
10 factor in the tumor extracts of both nuclei and cytoplasm. The nuclear extracts of the 
apparently benign tissue from the same quadrant was completely devoid of this factor in 
this assay system. However, the cytoplasmic extracts of apparently benign and atypical 
hyperplastic tissue show the presence of this binding factor at a low level. It is not clear 
if the histopathologically apparently benign tissue from the same quadrant as the tumor 
15 is truly benign or whether it is in an early pre-cancerous stage which this assay 

recognizes. Similarly, HPBF has also been detected from cytoplasmic/nuclear extracts 
of a breast cancer cell line (SKBR3) known to overexpress ERBB2. 

EXAMPLE 4 

20 Binding Specificity pf Pastor 

The binding specificity of the factor was confirmed with a sample which 
showed highest binding with probe B. Nuclear extracts ofbenign tissue were negative, 
whereas nuclear and cytoplasmic extracts of tumor specimens were positive for the 
Probe-B-binding factor. Binding of this factor with Probe B was completely abolished 

25 by excess unlabelled Probe B. This binding was not abolished using 50 fold unlabelled 
NFAB or SP1 probe, indicating that the binding of this factor is Probe-B-specific. 

EXAMPLES 

Determination of Factor as Protein 
30 It was next determined that the binding factor (HPBF) is a protein. 

For this, the nuclear and cytoplasmic extracts were fractionated through 
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SDS-polyacrylamide gel electrophoresis (SDS-PAGE). The proteins were transferred 
to nylon membrane and reacted with ^-labelled probe B (Southwestern assay). Both 
the membranes show binding activity with probe B and probe A. 



5 A protein of about SO kDa can bind to probe B only with tumor cell 

extracts (nuclear and cytoplasmic). The nuclear and cytoplasmic extracts of benign 
tissue failed to show any signal in the Southwestern assay, indicating that the level of 
tins DNA-binding protein is extremely low in apparently benign breast tissue. 

10 EXAMPLE 6 

Isolation and Purification of HPBF 
In order to isolate and purify the probe-B-specific DNA-binding 
protein (HPBF), a strategy for the purification of DNA-binding protein was used. This 
strategy is diagramed in Figure 2, using ds-oligonucleotide probe B to generate an 
15 affinity resin. 

Pooled cytoplasmic extract from three breast tumor specimens were 
subjected to the affinity purification. The extracts were passed through the affinity 
column and washed. The bound proteins were eluted with high salt buffer and three one 

20 milliliter fractions were collected. The proteins in the high salt eluate were fractionated 
through SDS-PAGE and silver-stained. The high salt wash in three fractions showed a 
specific protein at a very high concentration at around 44,000-47,000 dahon molecular 
weight. This again demonstrates the presence of a major protein, HPBF, of about 50 
kDa as has been previously shown in the Southwestern assay. HPBF was dialyzed 

25 against GMSA binding buffer and stored in aliquots at -70°C. 

EXAMPLE 7 

Binding Specificity of Purified HPBF 
The binding specificity of the purified HPBF was tested using GMSA 
30 and labelled probe B. 
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Only the tumor extract and purified HPBF bound DNA and formed a 
complex with probe B. The probe-B-specific binding protein is present in the tumor 
tissue specimen and the affinity purified protein. The benign extract did not show any 
binding. The specificity of the binding was competed out by unlabelled probe B, 
5 whereas a non-specific probe was unable to compete for the binding activity. 

These results clearly document the identification of a protein factor (a 
DNA-binding protein), HPBF, which specifically binds to the promoter region of the 
ERBB2 gene sequences. 

10 

EXAMPLE 8 

Amino Acid Sequence of Peptide of HPBF 
An asp-N digest of the purified protein was performed following 
routine procedures well known to those skilled in the art. An N-terminal ten amino add 

1 5 sequence of a peptide generated by the asp-N digest was determined using an automated 
protein micro sequencer. The ten amino acid sequence was determined to be Aspartic 
add- Glycine- Aspartic add- Asparagme- Phenylalanine- Proline- Leucine- Alanine- 
Proline- Phenylalanine (DGDNFPLAPF) (SEQ ID NO: 1). It should be noted that 
the amino add sequence of the protein may be slightly different due to possible 

20 sequencing errors. Such errors can be determined by repeating the methods to confirm 
sequence accuracy. The sequence was compared with known amino add sequences in 
Genbank and no matches were found, indicating the novel nature of this peptide. 

Further, a cyanogen bromide deavage of the purified protein was 
25 performed following routine procedures well known to those skilled in the art An 
N-terminal ten amino add sequence of a peptide generated by the cyanogen bromide 
cleavage was determined using an automated protein micro sequencer. The ten amino 
add sequence was determined to be Lysine- Isoleudne- Alanine* Isoleucine- Glutamic 
acid- Alanine- Glycine- Tyrosine- Aspartic; add- Phenylalanine (KIAIEAGYDF) 
30 (SEQ ID N0.2). The sequence was compared with known amino add sequences in 
Genbank and no match was found, indicating the novel nature of this peptide. 
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Therefore, these results indicate that HPBF (ERBB2 gene specific 
DNA-binding protein) is a newly discovered protein with known biological function, 
that has never been documented. 

EXAMPLE 9 

HPBF Induces Cell Proliferation 
Purified and isolated HPBF was micro-injected into serum-starved 
NIH-3T3 cells as has been described in the scientific literature. 97 



10 Microinjection of HPBF into the quiescent NIH 3T3 cells induced the 

onset of DNA synthesis as detailed in TABLE 1 herein. DNA synthesis increased 12- 13 
fold with HPBF. The DNA synthesis was increased 28 fold in the presence of the Ras 
oncogene and HPBF, suggesting that the factor either has a autogenic activity or is a 
component of mitogenic signalling pathways. The Ras oncogene was microinjected at 

IS an amount that gives minimal stimulation, as shown in Table I, since mnytmnl 

stimulation as reported by Smith et a/. 97 would not allow the HPBF response to be 
measured. Bovine serum albumin (BSA) was used as a control and showed, at most, a 
two-fold induction compared to the twelve to thirteen-fold increase induced by two 
separate extracts of HPBF. This induction of cell proliferation can be competed out 

20 slightly by incubating with probe B (ds-oligonucleotide 3), but not with nonspecific 
probe A (ds-oligonucleotide 2). 
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TABLE I 



10 



15 



20 



Sample 



BSA 

HPBF extract 1 

HPBF extract 2 

HPBF-1 + Probe A 
HPBF-1 + Probe B 
c-Ras 

HPBF-1 + c-Ras 



% Injected 

Cells 
in S-Phase 



3 
38 

32 

25 
16 
19 
72 



Fold 
Induction 



2 (1) 

13 (4) 

12 (3) 

9 (3) 

4 (2) 

5 (2) 
28 (7) 



25 



EXAMPLE 10 



HPBF Can Be Measured in Sera 



An ELISA assay of sera from breast, pancreas and kidney cancer patients 
against an anti-HPBF polyclonal antiserum demonstrated the presence of HPBF in the 
sera of breast cancer patients. 



30 

The polyclonal anti-HPBF sera were developed in hyperimmunized mice 
and were a pool of sera from three mice. The mice were being injected with purified 
and isolated HPBF for the production of monoclonal antibodies and the sera were 
obtained to determine the response of the immunized mice to the purified protein. 

35 

EXAMPLE 11 

Production of Polyclonal and Monoclonal Antibodies 



40 



Polyclonal antibodies against the human breast tumor-derived protein 
(HPBF) found in both nucleus and cytoplasm, were prepared by immunization of a 
NZW rabbit The material used for immunization was purified from a crude nuclear 
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extract by oligonucleotide affinity chromatography. The animal was injected with the 
purified protein emulsified with Freund's Complete Aduvant for the initial injection, then 
emulsified with Freund*s Incomplete Aduvant for a second injection, and finally boosted 
with an injection of protein antigen in aqueous phase only. The animal was bled at 
5 weekly intervals and the serum analyzed for antibody activity using ELISA methodology 
with the purified antigen coated on the plate. The antiserum at peak development could 
be diluted >1 : 10,000 and still retain activity. Also, the antiserum was also used in a 
Western blot format to identify the antigen on a polyacrylamide gel at the correct 
molecular weight. This antibody retained activity after purification of the 
1 0 immunoglobulin by protein A-sepharose chromatography. 

Monoclonal antibodies specifically reactive with HPBF protein were also 
prepared by immunizing a Balb/cAnnCr mouse with the affinity-purified protein after a 
further purification by cutting the specific band from a polyacrylamide gel. A similar 

15 immunization protocol was used, as described for polyclonal antibody production. After 
the mouse antiserum was shown to have antibody activity by ELISA testing, the animal 
was sacrificed and the spleen harvested. A spleen cell suspension was used to do a 
standard polyethylene glycol 1500 mediated-cell fusion with mouse myeloma 8.653 cells 
to form hybrids. Culture supernatants from the resulting cell hybridomas were screened 

20 for antibody activity using the same ELISA method. Antibody positive wells were 

cloned in two stages by limiting dilution to derive the present twenty-one clones that are 
being evaluated. AD have antibody activity in the ELISA, and some are Western blot 
positive as well. Purified antibody has been made from some of these clones, and some 
of these, as well as the polyclonal antibody react with breast cancer cells in 

25 immunohistochemical studies. 

The invention has been described in an illustrative manner, and it is to be 
understood that the terminology which has been used is intended to be in the nature of 
30 words of description rather than of limitation. 
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Throughout this application, various publications are referenced. The 
disclosures of these publications in their entireties are hereby incorporated by reference 
into this application in order to more fully describe the state of the art to which this 
invention 
S pertains. 

Although the present process has been described with reference to 
specific details of certain embodiments thereof it is not intended that such details should 
be regarded as limitations upon the scope of the invention except as and to the extent 
1 0 that they are included in the accompanying claims. 

Throughout this application various publications are referenced by full 
citation or numbers. Full citations for the publications referenced by number are listed 
below. The disclosures of these publications in their entireties are hereby incorporated 
15 by reference into this application in order to more fully describe the state of the art to 
which this invention pertains. 
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SEQUENCE LISTING 



<1) GENERAL INFORMATION : 

(i) APPLICANT: Raziuddin 

sarkar, Fazlul H 

(ii) TITLE OF INVENTION: ERBB2 PROMOTER BINDING PROTEIN IN* 

NEOPLASTIC DISEASE 

(ill) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NEEDLE & ROSENBERG, P.C. 

(B) STREET: Suite 1200, 127 Peachtree Street, NE 

(C) CITY: Atlanta 

(D) STATE: Georgia 

(E) COUNTRY: USA 

(F) ZIP: 30303-1811 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: David G. Perryman 

(B) REGISTRATION NUMBER: 33,438 

(C) REFERENCE/DOCKET NUMBER: 1414.608 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (404) 688-0770 

(B) TELEFAX: (404) 688-9880 



(2) INFORMATION FOR SEQ ID NO:l? 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Asp Gly Asp Asn Phe Pro Leu Ala Pro Phe 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Lys 
1 



lie Ala lie Glu Ala Gly Tyr Asp Phe 

... 5 10 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TACGAATGAA GTTGTGAAGC TGAGATTCCC CTCC 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : " linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CTTACTTCAA CACTTCGACT CTAAGGGGAG GCAT 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GCTCCCAATC ACAGGAGAAG GAGGAGGTGG AGGAGGAGGG CTGCTTGAGG AAGTATAAGA 60 
ATGAAGTTGT GAAGCTGAGA TTCCCCTCC 89 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GGGTTAGTGT CCTCTTCCTC CTCCACCTCC TCCTCCCGAC GAACTCCTTC ATATTCTTAC 60 

TTCAACACTT CGACTCTAAG GGGAGGCAT 89 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE:, nucleic acid 
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(C) STRANDEDNESS ; single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCTCCCAATC ACAGGAGAAG GAGGAGGTGG AGGAGGAGGG CTGCTTGAGG AAGTATAAGA 60 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGGTTAGTGT CCTCTTCCTC CTCCACCTCC TCCTCCCGAC GAACTCCTTC ATATTCTCAT 60 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



AATTCTCGAG 


CTCGTCGACC 


GGTCGACGAG 


CTCGAGGGTC GACGAGCTCG 


AGGGCGCGCG 


60 


CCCGGCCCCC 


ACCCCTCGCA 


GCACCCCGCG 


CCCCGCGCCC TCCCAGCCGG 


GTCCAGCCGG 


120 


AGCCATGGGG 


CCGGAGCCGC 


AGTGAGCACC 


ATGGAGCTGG CGGCCTTGTG 


CCGCTGGGGG 


180 


CTCCTCCTCG 


CCCTCTTGCC 


CCCCGGAGCC 


GCGAGCACCC AAGTGTGCAC 


CGGCACAGAC 


240 


ATGAAGCTGC 


GGCTCCCTGC 


CAGTCCCGAG 


ACCCACCTGG ACATGCTCCG 


CCACCTCTAC 


300 


CAGGGCTGCC 


AGGTGGTGCA 


GGGAAACCTG 


GAACTCACCT ACCTGCCCAC 


CAATGCCAGC 


360 


CTGTCCTTCC 


TGCAGGATAT 


CCAGGAGGTG 


CAGGGCTACG TGCTCATCGC 


TCACAACCAA 


420 


GTGAGGCAGG 


TCCCACTGCA 


GAGGCTGCGG 


ATTGTGCGAG GCACCCAGCT 


CTTTGAGGAC 


480 


AACTATGCCC 


TGGCCGTGCT 


AGACAATGGA 


GACCCGCTGA ACAATACCAC 


CCCTGTCACA 


540 


GGGGCCTCCC 


CAGGAGGCCT 


GCGGGAGCTG 


CAGCTTCGAA GCCTCACAGA 


GATCTTGAAA 


600 


GGAGGGGTCT 


TGATCCAGCG 


GAACCCCCAG 


CTCTGCTACC AGGACACGAT 


TTTGTGGAAG 


660 


GACATCTTCC 


ACAAGAACAA 


CCAGCTGGCT 


CTCACACTGA TAGACACCAA 


CCGCTCTCGG 


720 


GCCTGCCACC 


CCTGTTCTCC 


GATGTGTAAG 


GGCTCCCGCT GCTGGGGAGA 


GAGTTCTGAG 


780 


GATTGTCAGA 


GCCTGACGCG 


CACTGTCTGT 


GCCGGTGGCT GTGCCCGCTG 


CAAGGGGCCA 


840 


CTGCCCACTG 


ACTGCTGCCA 


TGAGCAGTGT 


GCTGCCGGCT GCACGGGCCC 


CAAGCACTCT 


900 


GACTGCCTGG 


CCTGCCTCCA 


CTTCAACCAC 


AGTGGCATCT GTGAGCTGCA 


CTGCCCAGCC 


960 
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CTGGTCACCT 


ACAACACAGA 


CACGTTTGAG 


TCCATGCCCA 


ATCCCGAGGG 


CCGGTATACA 


1020 


X X UbUvUvLA 




TGCCTGTCCC 


TACAACTACC 


TTTCTACGGA 


CGTGGGATCC 


1080 


X V7V*m^^^*X 




GCACAACCAA 


GAGGTGACAG 


CAGAGGATGG 


AACACAGCGG 


1140 


T GT GAGAAGT 


GCAGCAAGCC 


CTGTGCCCGA 


GTGTGCTATG 




GGAGCACTTG 


1200 


CGAGAGGTGA 


GGGCAGTTAC 


CAGTGCCAAT ATCCAGGAGT 


TTGCTGGCTG 


caagaagatc 


1260 


TTT GGGAGCC 


TGGCATTTCT 


GCCGGAGAGC 


TTTGATGGGG 


AwCuAGCCTC 


CAACACTGCC 


1320 


CCGCTCClVRf 


CAGAGCAGCT 


CCAAGTGTTT 


GAGACTCTGG 


AAGAGAT CAC 


AGGTTACCTA 


1380 




CaTGGCCGGA 


CAGCCTGCCT 


GACCTCAGCG 


wwi^+mww%^+r+ Ft %k 

TCTTCCAGAA 


CCTGCAAGTA 


1440 




GAaTTCTGCA 


CAATGGCGCC 


TACTCGCTGA 


CCCTGCAAGG 


GCTGGGCATC 


1500 


J\vr\* X X 


GGCT6CGCTC ACTGAGGGAA 


CTGGGGAGTG 


GACTGGCCCT 


CATCCACCAT 


1560 


AACACCCACC 


TCTGCTTCGT 


GCACACGGTG 


CCCTGGGACC 


AGCTCTTTCG 


GAACCCGCAC 


1620 


CAAGCTCTGC 


TCCACACTGC 


CAACCGGCCA 


GAGGACGAGT 


GTGTGGGCGA 


GGGCCTGGCC 


1680 


TGCCACCAGC 


TGTGCGCCCG AGGGCACTGC 


TGGGGTCCAG 


GGCCCACCCA 


GTGTGTCAAC 


1740 


TGCAGCCAGT 


TCCTTCGGGG 


CCAGGAGTGC 


GTGGAGGAAT 


GCCGAGTACT 


GCAGGGGCTC 


1800 




ATGTGAATGC 


CAGGCACTGT 


TTGCCGTGCC 


ACCCTGAGTG 


TCAGCCCCAG 


1860 




TGACCTGTTT 


TGGACCGGAG 


GCTGACCAGT 


GTGTGGCCTG TGCCCACTAT 


1920 




CCTTCTGCGT 


GGCCCGCTGC 


CCCAGCGGTG 


TGAAACCTGA 


CCTCTCCTAC 


1980 


/vX V7l«l^W\X \+ X 


GGAAGTTTCC AGATGAGGAG 


GGCGCATGCC 


AGCCTTGCCC 


CATCAACTGC 


2040 


nULWiU X X 


GTGTGGACCT 


GGATGACAAG 


GGCTGCCCCG 


CCGAGCAGAG AGCCAGCCCT 


2100 


\* X UtHwi L*wi. 


TCGTCTCTGC 


GGTGGTTGGC ATTCTGCTGG 


TCGTGGTCTT 


GGGGGTGGTC 


2160 


X X lUVTWll Irtr 


TCATCAAGCG ACGGCAGCAG AAGATCCGGA 


AGTACACGAT 


GCGGAGACTG 


2220 




CGGAGCTGGT 


GGAGCCGCTG ACACCTAGCG 


GAGCGATGCC 


CAACCAGGCG 


2280 




TCCTGAAAGA 


GACGGAGCTG AGGAAGGTGA 


AGGTGCTTGG ATCTGGCGCT 


2340 


X X X w\3W\L/\V7 


TCTACAAGGG 


CATCTGGATC 


CCTGATGGGG 


AGAATGTGAA AATTCCAGTG 


2400 


wWWtX Wwvivj 


TGTTGAGGGA AAACACATCC 


CCCAAAGCCA 


ACAAAGAAAT CTTAGACGAA 


2460 


GCATACGTGA 


TGGCTGGTGT 


GGGCTCCCCA TATGTCTCCC 


GCCTTCTGGG 


CATCTGCCTG 


2526 


ACATCCACGG 


TGCAGCTGGT GACACAGCTT ATGCCCTATG 


GCTGCCTCTT AGACCATGTC 


2580 


CGGGAAAACC 


GCGGACGCCT 


GGGCTCCCAG 


GACCTGCTGA 


ACTGGTGTAT 


GCAGATTGCC 


2640 


AAGGGGATGA 


GCTACCTGGA GGATGTGCGG CTCGTACACA 


GGGACTTGGC 


CGCTCGGAAC 


2700 


GTGCTGGTCA 


AGAGTCCCAA 


CCATGTCAAA attacagact 


TCGGGCTGGC TCGGCTGCTG 


2760 


GACATTGACG 


AGACAGAGTA 


CCATGCAGAT 


GGGGGCAAGG 


TGCCCATCAA 


GTGGATGGCG 


2820 


CTGGAGTCCA 


TTCTCCGCCG 


GCGGTTCACC 


CACCAGAGTG 


ATGTGTGGAG 


TTATGGTGTG 


2880 
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ACTGTGTGGG AGCTGATGAC TTTTGGGGCC AAACCTTACG ATGGGATCCC AGCCCGGGAG 2940 
ATCCCTGACC TGCTGGAAAA GGGGGAGCGG CTGCCCCAGC CCCCCATCTG CACCATTGAT 3000 
GTCTACATGA TCATGGTCAA ATGTTGGATG ATTGACTCTG AATGTCGGCC AAGATTCCGG 3060 
GAGTTGGTGT CTGAATTCTC CCGCATGGCC AGGGACCCCC AGCGCTTTGT GGTCATCCAG 3120 
AATGAGGACT TGGGCCCAGC CAGTCCCTTG GACAGCACCT TCTACCGCTC ACTGCTGGAG 3180 
GAC GATGACA TGGGGGACCT GGTGGATGCT GAGGAGTATC TGGTACCCCA GCAGGGCTTC 3240 
TTCTGTCCAG ACCCTGCCCC GGGCGCTGGG GGCATGGTCC ACCACAGGCA CCGCAGCTCA 3300 
TCTACCAGGA GTGGCGGTGG GGACCTGACA CTAGGGCTGG AGCCCTCTGA AGAGGAGGCC 3360 
CCCAGGTCTC CACTGGCACC CTCCGAAGGG GCTGGCTCCG ATGTATTTGA TGGTGACCTG 3420 
GGAATGGGGG CAGCCAAGGG GCTGCAAAGC CTCCCCACAC ATGACCCCAG CCCTCTACAG 3480 
CGGTACAGTG AGGACCCCAC AGTACCCCTG CCCTCTGAGA CTGATGGCTA CGTTGCCCCC 3540 
CTGACCTGCA GCCCCCAGCC TGAATATGTG AACCAGCCAG ATGTTCGGCC CCAGCCCCCT 3600 
TCGCCCCGAG AGGGCCCTCT GCCTGCTGCC CGACCTGCTG GTGCCACTCT GGAAAGGGCC 3660 
AAGACTCTCT CCCCAGGGAA GAATGGGGTC GTCAAAGACG TTTTTGCCTT TGGGGGTGCC 3720 
GTGGAGAACC CCGAGTACTT GACACCCCAG GGAGGAGCTG CCCCTCAGCC CCACCCTCCT 3780 
CCTGCCTTCA GCCCAGCCTT CGACAACCTC TATTACTGGG ACCAGGACCC ACCAGAGCGG 3840 
GGGGCTCCAC CCAGCACCTT CAAAGGGACA CCTACGGCAG AGAACCCAGA GTACCTGGGT 3900 
CTGGACGTGC CAGTGTGAAC CAGAAGGCCA AGTCCGCAGA AGCCCTGATG TGTCCTCAGG 3960 
GAGCAGGGAA GGCCTGACTT CTGCTGGCAT CAAGAGGTGG GAGGGCCCTC CGACCACTTC 4020 
CAGGGGAACC TGCCATGCCA GGAACCTGTC CTAAGGAACC TTCCTTCCTG CTTGAGTTCC 4080 
CAGATGGCTG GAAGGGGTCC AGCCTCGTTG GAAGAGGAAC AGCACTGGGG AGTCTTTGTG 4140 
GATTCTGAGG CCCTGCCCAA TGAGACTCTA GGGTCCAGTG GATGCCACAG CCCAGCTTGG 4200 
CCCTTTCCTT CCAGATCCTG GGTACTGAAA GCCTTAGGGA AGCTGGCCTG AGAGGGGAAG 4260 
CGGCCCTAAG GGAGTGTCTA AGAACAAAAG CGACCCATTC AGAGACTGTC CCTGAAACCT 4320 
AGTACTGCCC CCCATGAGGA AGGAACAGCA ATGGTGTCAG TATCCAGGCT TTGTACAGAG 4380 
TGCTTTTCTG TTTAGTTTTT ACTTTTTTTG TTTTGTTTTT TTAAAGACGA AATAAAGACC 4440 
CAGGGGAGAA TGGGTGTTGT ATGGGGAGGC AAGTGTGGGG GGTCCTTCTC CACACCCACT 4500 
TTGTCCATTT GCAAATATAT TTTGGAAAAC 4530 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCCGGGGGTC CTGGAAGCCA CAAGGTAAAC ACAACACATC CCCCTCCTTG ACTATGCAAT 60 

TTTACTAGAG GATGTGGTGG GAAAACCATT ATTTGATATT AAAACAAATA GGCTTGGGAT 120 

GGAGTAGGAT GCAAGCTCCC CAGGAAAGTT TAAGATAAAA CCTGAGACTT AAAAGGGTGT 180 

TAAGAGTGGC AGCCTAGGGA ATTTATCCCG GACTCCGGGG GAGGGGGCAG AGTCACCAGC 240 

CTCTGCATTT AGGGATTCTC CGAGGAAAAG TGTGAGAACG GCTGCAGGCA ACCCAGGCGT 300 

CCCGGCGCTA GGAGGGACGA CCCAGGCCTG CGCGAAGAGA GGGAGAAAGT GAAGCTGGGA 360 

GTTGCCGACT CCCAGACTTC GTTGGAATGC AGTTGGAGGG GGCGAGCTGG GAGCGCGCTT 420 

GCTCCCAATC ACAGGAGAAG GAGGAGGTGG AGGAGGAGGG CTGCTTGAGG AAGTATAAGA 480 

ATGAAGTTGT GAAGCTGAGA TTCCCCTCCA TTGGGACCGG AGAAACCAGG GGAGCCCCCC 540 

GGGCAGCCGC GCGCCCCTTC CCACGGGGCC CTTTACTGCG CCGCGCGCCC GGCCCCCACC 600 

CCTCGCAGCA CCCCGCGCCC CGCGCCCTCC CAGCCGGGTC CAGCCGGAGC CAT GGGGCC G 660 

GAGCCGCAGT GAGCACCATG GAGCTGGCGG CCTTGTGCCG CTGGGGGCTC CTCCTCGCCC 720 

TCTTGCCCCC CGGAGCCGCG AGCACCCAAG GTGGGTC 757 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CCCGGGGGTC CTGGAAGCCA CAAGGTAAAC ACAACACATC CCCCTCCTTG ACTATCAATT 60 

TTACTAGAGG ATGTGGTGGG AAAACCATTA TTTGATATTA AAACAAATAG GCTTGGGATG 120 

GAGTAGGATG CAAGCTCCCA GGAAAGTTTA AGATAAAACC TGAGACTTAA AAGGGTGTTA 180 

AGAGTGGCAG CCTAGGGAAT TTATCCCGGA CTCCGGGGGAGGGGGCAGAG TCACCAGCCT 240 

CTGCATTTAG GGATTCTCCG AGGAAAAGTG TGAGAACGGC TGCAGGCAAC CCAGCTTCCC 300 

GGCGCTAGGA GGGACGCACC CAGGCCTGCG CGAAGAGAGG GAGAAAGTGA AGCTGGGAGT 360 

TGCCACTCCC AGACTTGTTG GAATGCAGTT GGAGGGGGCG AGCTGGGAGC GCGCTTGCTC 420 

CCAATCACAG GAGAAGGAGG AGGTGGAGGA GGAGGGCTGC TTGAGGAAGT ATAAGAATGA 480 

AGTTGTGAAG CTGAGATTCC CCTCCATTGG GACCGGAGAA ACCAGGGAGC CCCCCCGGG 539 
(2) INFORMATION FOR SEQ ID NO:12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 
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(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GAATTCGGCA 


CGAGTACAGA 


AGGTAAAGGC 


TGTCTCTATG GAGCCACTGG 


CCATCCTGGT 


60 


GCTGCTGTGC 


TTTCCGATCT 


GCTCAGCATA 


TCCT CTGCAT GGGGCAGTGA 


GACAAGACCA 


120 


CTCAACCATG 


GATCTTGCTC 


AGCAATACCT 


AGAAAAATAC TACAACTTTA 


GAAAAAATGA 


180 


GAAACAATTT 


TTCAAAAGAA 


AGGACAGTAG 


TCCTGTTGTC AAAAAAATTG 


AAGAAATGCA 


240 


GAAGTTCCTT 


GGGCTGGAGA 


TGACAGGGAA 


GCTGGACTCG AACACTGTGG 


AGATGATGCA 


300 


CAAGCCCCGG 


TGTGGTGTTC 


CCGACGTTGG 


TGGCTTCAGT ACCTTTCCAG 


GTTCACCCAA 


360 


ATGGAGGAAA 


AACCACATCT 


CCTACAGGAT 


TGTGAATTAT ACACTGGATT 


TACCAAGAGA 


420 


GAGTGTGGAT 


TCTGCCATTG 


AGAGAGCTTT 


GAAGGTCTGG GAGGAGGTGA 


CCCCACTCAC 


480 


ATTCTCCAGG 


ATCTCTGAAG 


GAGAGGCTGA 


CATAATGATC TCCTTTGCAG 


TTGGAGAACA 


540 


TGGAGACTTT 


TACCCTTTTG 


ATGGAGTGGG 


ACAGAGCTTG GCTCATGCCT 


ACCCACCTGG 


600 


CCCTGGATTT 


TATGGAGATG 


CTCACTTCGA 


TGATGATGAG AAATGGTCAC 


TGGGACCCTC 


660 


AGGGACCAAT 


TTATTCCT GG 


TTGCTGCGCA 


TGAACTTGGT CACTCCCTGG 


GTCTCTTTCA 


720 


CTCAAACAAC 


AAAGAATCTC 


TGATGTACCC 


AGTCTACAGG TTCTCCACGA 


GCCAAGCCAA 


780 


CATTCGCCTT 


TCTCAGGATG 


ATATAGAGGG 


CATTCAATCC CTGTATGGAG 


CCCGCCCCTC 


840 


CTCTGATGCC 


ACAGTGGTTC 


CTGTGCCCTC 


TGTCTCTCCA AAACCTGAGA 


CCCCAGTCAA 


900 


ATGTGATCCT 


GCTTTGTCCT 


TTGATGCAGT 


CACCATGCTG AGAGGGGAAT 


TCCTATTCTT 


960 


TAAAGACAGG 


CACTTCTGGC 


GTAGAACCCA 


GTGGAATCCC GAGCCTGAAT 


TCCATTTGAT 


1020 


TTCAGCATTT 


TGGCCCTCTC 


TTCCTTCAGG 


CTTAGATGCT GCCTATGAGG 


CAAATAACAA 1080 


GGACAGAGTT 


CTGATTTTTA 


AAGGAAGTCA 


GTTCTGGGCA GTCCGAGGAA 


ATGAAGTCCA 


1140 


AGCAGGTTAC 


CCAAAGAGGA 


TCCACACTCT 


TGGCTTTCCT CCCACCGTGA 


AGAAGATTGA 


1200 


TGCAGCTGTT 


TTTGAAAAGG 


AGAAGAAGAA 


GACGTATTTC TTTGTAGGTG 


ACAAATACTG 


1260 


GAGATTTGAT 


GAGACAAGAC 


AGCTTATGGA 


TAAAGGCTTC CCGAGACTGA 


TAACAGATGA 


1320 


CTTCCCAGGA 


ATTGAGCCAC 


AAGTTGATGC 


TGTGTTACAT GCATTTGGGT 


TTTTTTATTT 


1380 


CTTCTGTGGA. 


TCATCACAGT 


TCGAGTTTGA 


CCCCAATGCC AGGACGGTGA 


CACACACACT 


1440 


GAAGAGCAAC 


AGCTGGCTGT 


TGTGCTGATT 


ATCATGATGA CAAGACATAT 


ACAACACTGT 


1500 


AAAATAGTAT 


TTCTCGCCTA 


ATTTATTATG 


TGTCATAATG ATGAATTGTT 


CCTGCATGTG 


1560 
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CTGTGGCTCG AGAT GAGCC C AGCAGATAGA TGTCTTTCTT AATGAACCAC AGAGCATCAC 1620 

CTGAGCACAG AAGTGAAAGC TTCTCGGTAC ACTAGGTGAG AGGATGCATC CCCATGGGTA 1680 

CTTTATTGTT TAATAAAGAA CTTTATTTTT GAACCAT 1717 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



GATATCAAGA 


GGGTGATGCA 


AACGTCCCAG GAGTGTTCAA GATAAAACCG 


GAGACTGCAA 


60 


AGACGGGTAA AGGGATGCTG 


TGCTTTTAGG AAGTGGATGA GAACTGCAAG CAAGCAAGCA 


120 


AGCAAGCAAG 


CAAGCAAGCA 


AGCAAGCAAG CAAGCAAGCT AGGCGTCGGG GCACAGGGCA 


180 


GGCGCACCCA 


GGCCTGCGCC 


GGGAGGGAGA AAGTGAAAGC TGGGAGCAGC 


CACTCCCAGT 


240 


CTTGCTGGAA 


TGCAGTTGGA 


GGGGTGGGGG GGCGAGCCGA GAGCGCGCGG 


CTGCCAATCA 


300 


CGGGCGGAGG AGGAGGTGGA 


GGAGGAGGGC TGCTCGAGGA AGTGCGGCGT 


GAAGTTGTGG 


360 


AGCTGAGATT 


GCCCGCCGCT 


GGGGACCCGG AGCCCAGGAG CGCCCCTTCC 


CAGGCGGCCC 


420 


CTTCCGGCGC 


CGGCCTGTGC 


CTGCCCTCGC CGCGCCCCCC GCGCCCGCAG 


CCTGGTCGAG 


480 


CCTGAGCCAT 


GGGGCCGGAG 


CCGCAATGAT CATCATGGAG CTGGCGGCCT 


GGTGCCGCTG 


540 


GGGGTTCCTC 


CTCGCCCTCC 


TGCCCCCCGG AATCGCGGGC ACCCAAGGTG 


GGTCTTGGCT 


600 


TGGGAAGGGC 


TCTGGCCGCT 


GTGCTGCCCA CGGGCCGGAG CGCGGAGCTC 




650 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3955 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CCGGGCCGGA GCCGCAATGA TCATCATGGA GCTGGCGGCC TGGTGCCGCT GGGGGTTCCT 60 

CCTCGCCCTC CTGCCCCCCG GAATCGCGGG CACCCAAGTG TGTACCGGCA CAGACATGAA 120 

GTTGCGGCTC CCTGCCAGTC CTGAGACCCA CCTGGACATG CTCCGCCACC TGTACCAGGG 180 

CTGTCAGGTA GTGCAGGGCA ACTTGGAGCT TACCTACGTG CCTGCCAATG CCAGCCTCTC 240 

ATTCCTGCAG GACATCCAGG AAGTTCAGGG TTACATGCTC ATCGCTCACA ACCAGGTGAA 300 

GCGCGTCCCA CTGCAAAGGC tGCGCATCGT GAGAGGGACC CAGCTCTTTG AGGACAAGTA 360 

TGCCCTGGCT GTGCTAGACA ACCGAGATCC TCAGGACAAT GTCGCCGCCT CCACCCCAGG 420 
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CAGAACCCCA GAGGGGCTGC GGGAGCTGCA 
AGGAGTTTTG ATCCGTGGGA ACCCTCAGCT 
CGTCTTCCGC AAGAATAACC AACTGGCTCC 
CTGTCCACCT TGTGCCCCCG CCTGCAAAGA 
CTGTCAGATC TTGACTGGCA CCAT CTGTAC 
GCCCACTGAC TGCTGCCATG AGCAGTGTGC 
CTGCCTGGCC TGCCTCCACT TCAATCATAG 
CGTCACCTAC AACACAGACA CCTTTGAGTC 
TGGTGCCAGC TGCGTGACCA CCTGCCCCTA 
CACTCTGGTG TGTCCCCCGA ATAACCAAGA 
TGAGAAATGC AGCAAGCCCT GTGCTCGAGT 
AGGGGCGAGG GCCATCACCA GTGACAATGT 
TGGGAGCCTG GCATTTTTGC CGGAGAGCTT 
GCTGAGGCCT GAGCAGCTCC AAGTGTTCGA 
CATCTCAGCA TGGCCAGACA GTCTCCGTGA 
TCGGGGACGG ATTCTCCACG ATGGCGCGTA 
CTCGCTGGGG CTGCGCTCAC TGCGGGAGCT 
CGCCCATCTC TGCTTTGTAC ACACTGTACC 
GGCCCTGCTC CACAGTGGGA ACCGGCCGGA 
CTGTAACTCA CTGTGTGCCC ACGGGCACTG 
CTGCAGTCAT TTCCTTCGGG GCCAGGAGTG 
CCCCCGGGAG TATGTGAGTG ACAAGCGCTG 
AAACAGCTCA GAGACCTGCT TTGGATCGGA 
CAAGGACTCG TCCTCCTGTG TGGCTCGCTG 
CATGCCCATC TGGAAGTACC CGGATGAGGA 
CACCCACTCC TGTGTGGATC TGGATGAACG 
GGTGACATTC ATCATTGCAA CTGTAGAGGG 
CGTTGGAATC CTAATCAAAC GAAGGAGACA 
GCTGCAGGAA ACTGAGTTAG TGGAGCCGCT 
TCAGATGCGG ATCCTAAAAG AGACGGAGCT 
TTTTGGCACT GTCTACAAGG GCATCTGGAT 
GGCTATCAAG GTGTTGAGAG AAAACACATC 
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GCTT C GAAGT 


CTCACAGAGA 


TCCTGAAGGG 


480 


CTGCTACCAG 


GACATGGTTT 


TGTGGAAGGA 


540 


TGTCGATATA 


GACACCAATC 


GTTCCCGGGC 


600 


CAATCACTGT 


TGGGGTGAGA 


GTCCGGAAGA 


660 


CAGTGGTTGT 


GCCCGGTGCA 


AGGGCCGGCT 


720 


CGCAGGCTGC 


ACGGGCCCCA 


AGCATTCTGA 


780 


TGGTATCT GT 


GAGCTGCACT 


GCCCAGCCCT 


840 


CATGCACAAC 


CCTGAGGGTC 


GCTACACCTT 


900 


CAACTACCT G 


TCTACGGAAG 


TGGGATCCTG 


960 


GGTCACAGCT 


GAGGACGGAA 


CACAGCGTTG 


1020 


GTGCTATGGT 


CTGGGCATG6 


AGCACCTTCG 


1080 


CCAGGAGTTT 


GATGGCTGCA 


AGAAGATCTT 


1140 


TGATGGGGAC 


CCCTCCTCCG 


GCATTGCTCC 


1200 


AACCCTGGAG 


GAGATCACAG 


GTTACCTGTA 


1260 


CCTCAGTGTC 


TTCCAGAACC 


TTCGAATCAT 


1320 


CTCATTGACA 


CTGCAAGGCC 


TGGGGATCCA 


1380 


GGGCAGTGGA 


TTGGCTCTGA 


TTCACCGCAA 


1440 


TTGGGACCAG 


CTCTTCCGGA 


ACCCACATCA 


1500 


AGAGGACTTG 


TGCGTCTCGA 


GCGGCTTGGT 


1560 


CTGGGGGCCA 


GGGCCCACCC 


AGTGTGTCAA 


1620 


TGTGGAGGAG 


TGCCGAGTAT 


GGAAGGGGCT 


1680 


TCTGCCGTGT 


CACCCCGAGT 


GTCAGCCTCA 


1740 


GGCTGATCAG 


TGTGCAGCCT 


GCGCCCACTA 


1800 


CCCCAGTGGT 


GTGAAACCGG 


ACCTCTCCTA 


1860 


uuvUuiuuU 




CGAXGAACTG 


1920 


AGGCTGCCCA 


GCAGAGCAGA 


GAGCCAGCCC 


1980 


CGTCCTGCTG 


TTCCTGATCT 


TAGTGGTGGT 


2040 


GAAGATCCGG 


AAGTATACGA 


TGCGTAGGCT 


2100 


GACGCCCAGC 


GGAGCAATGC 


CCAACCAGGC 


2160 


AAGGAAGGTG 


AAGGTGCTTG 


GATCAGGAGC 


2220 


CCCAGAXGGG 


GAGAATGTGA 


AAATCCCCGT 


2280 


TCCTAAAGCC 


AACAAAGAAA 


TTCTAGATGA 


2340 
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AGCGTATGTG 




TGGGTTCTCC 


GTATGTGTCC 


CGCCTCCTGG 


GCATCTGCCT 


2400 






TGACACAGCT 


TATGCCCTAC 


GGCTGCCTTC 


TGGACCATGT 


2460 






TAGGCTCCCA 


GGACCTGCTC 


AACTGGTGTG 


TTCAGATTGC 


2520 






AGGACGTGCG 


GCTTGTACAC 


AGGGACCTGG 


CTGCCCGGAA 


2580 


TGTGCTAGTC 




ACCACGTCAA GATTACAGAT 


TTCGGGCTGG 


CTCGGCTGCT 


2640 


buAwUTLbAl 


r^TA r^TA rm < ■•!• 


ACCATGCAGA TGGGGGCAAG 


GTGCCCATCA 


AATGGATGGC 


2700 






GCCGGTTCAC 


CCATCAGAGT 


GATGTGTGGA 


GCTATGGAGT 


2760 


GACTGTGTGG 


iwibL 1 wtM.1 LxH 


CTTTTGGGGC 


CAAACCTTAC 


GATGGAATCC 


CAGCCCGGGA 


2820 




TTGCTGGAGA 


AGGGAGAACG 


CCTACCTCAG 


CCTCCAATCT 


GCACCATTGA 


2880 


TGTCTACATG 


ATTATGGTCA 


AATGTTGGAT 


GATTGACTCT 


GAATGTCGCC 


CGAGATTCCG 


2940 


GGAGTTGGTG 


TCAGAATTTT 


CACGTATGGC 


GAGGGACCCC 


CAGCGTTTTG 


TGGTCATCCA 


3000 


GAACGAGGAC 


TTGGGCCCAT 


CCAGCCCCAT 


GGACAGTACC 


TTCTACCGTT 


CACTGCTGGA 


3060 


AGATGATGAC 


ATGGGTGACC 


TGGTAGACGC TGAAGAGTAT 


CTGGTGCCCC 


AGCAGGGATT 


3120 


CTTCTCCCCG 


GACCCTACCC 


CAGGCACTGG 


GAGCACAGCC 


CATAGAAGGC 


ACCGCAGCTC 


3180 


GTCCACCAGG 


AGTGGAGGTG 


GTGAGCTGAC ACTGGGCCTG 


GAGCCCTCGG 


AAGAAGGGCC 


3240 


CCCCAGATCT 


CCACTGGCTC 


CCTCGGAAGG 


GGCTGGCTCC 


GATGTGTTTG 


ATGGTGACCT 


3300 


GGGAATGGGG 

bWbu 1 ALHbL 


GTAACCAAAG 


GGCTGCAGAG 
CATTACCTCT 


CCTCTCTCCA 
GCCCCCCGAG 


CATGACCTCA 
ACT GATGGCT 


GCCCTCTACA 
ATGTTGCTCC 


3360 
3420 






CCGAGTATGT 


GAACCAATCA 


GAGGTTCAGC 


ctcagcctcc 


3480 






TGCCTCCTGT 


CCGGCCTGCT 


GGTGCTACTC 


TAGAAAGACC 


3540 


CAAGACTCTC 


TCTCCTGGGA 


AGAATGGGGT 


TGTCAAAGAC 


GTTTTTGCCT 


TCGGGGGTGC 


3600 


TGTGGAGAAC 


CCTGAATACT 


TAGTACCGAG AGAAGGCACT 


GCCTCTCCGC 


CCCACCCTTC 


3660 


TCCTGCCTTC 


AGCCCAGCCT 


TTGACAACCT 


CTATTACTGG 


GACCAGAACT 


CATCGGAGCA 


3720 


GGGGCCTCCA 


CCAAGTAACT 


TTGAAGGGAC 


CCCCACTGCA 


GAGAACCCTG 


AGTACCTAGG 


3780 


CCTGGATGTA 


CCTGTATGAG 


ACGTGTGCAG ACGTCCTGTG 


CTTTCAGAGT 


GGGGAAGGCC 


3840 


TGACTTGTGG 


TCTCCATCGC 


CACAAAGCAG 


GGAGAGGGTC 


CTCTGGCCAC 


ATTACATCCA 


3900 


GGGCAGACGG 


CTCTACCAGG 


AACCTGCCCC 


GAGGAACCTT 


TCCTTGCTGC 


TTGAA 


3955 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
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\SrlXx\X 


MuHuiuiiw wiwiw\i*i-Ho iiMuMWiinn. wVCATTCCCT TCCCAGGCTG 


60 


•rt A 1. X AAVV«»V»» X \S 


AiVjvaM.iV»x iimwwjA 1 iAAuACTGT GATGCAAACG 


120 


TTrpaacn CI* 

1 A W^n/vtUtrluX 


BTrraLftaTa zii»iif*£*r?Jvccf* uiifii\f* r Pft — nii nrn/»vr<m«n »v m^«_m M m«. 
AiLUViutHiA /w\k*^w\fc*w^ M/usHwxbWvV AGAGGGGTAA AGAGATGCCC 


180 




Mhuiwwiwi wviuiowviw tHHvUviouA AGCuAGGCGT CAGGGGAGAG 


240 




lrWU7l*l»X uWty ^^*W»\3*W»watr* unn/VviuHHb CTGGGAGGAG CGACTCCCAG 


300 


VCFTGCFGGA 


A(3T*C2VCI"Pflf5 Gf£f*PRIi(Vr"R fZfTi rir*r±r*r*r*r+ rvwrn^wMMvi 

AUA^*n\9XXW *>VjwWjX VjVJVjvj V7V7V7x^Ut/AV9^^V7 wwiwVUCGVw GCxTwCGAATC 


360 


ACGGGCGGCG 


GAGGAGGCGG AGGAGGAGGG CTGCTCGAGG HJLtZTfZftiizr'f t*r*iv n r*mmr*i*r* 


420 


GAGCTGAGAT 


TGCCCGCCGC TGGGGACCCG GAGCCCAGGA GCGCCCCTTC CCAGGCGGCC 


480 


CCTTCCGGCG 


CCGCGCCTGT GCCTGCCCTC GCCGCGCCCC GGCCCGCAGC CTGGTCGAGC 


540 


CTGAGCCATG 


GGGCCGGAGC CGCAGTGATC ATCATGGAGC TGGCGGCCTG GTGCCGTTGG 


600 


GGGTTCCTCC 


TCGCCCTCCT GTCCCCCGGA GCCGCGGGTA CCGAAGGTGG GTCTTGGCTT 


660 


GGGGAGGGCT 


CGGGCCGCTA CGCTGCCCAC GGCGGCCGGA GCCGCGGGGC CCCGAGAGCT 


720 


C 




721 
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What is claimed is: 

1 . A purified protein designated HPBF which binds to the promoter region of the 
ERBB2 gene and has a molecular weight of about 44,000-47,000 daltons as determined 
by sodium dodecjd sulfite polyacrylamide gel electrophoresis under reducing conditions 
and which comprises the amino add sequence of SEQ ID NOS : 1 and 2. 

2. A purified antibody which specifically binds the protein of Claim 1 . 

3. The antibody of Claim 2, wherein the antibody is conjugated to a therapeutic 
drug. 

4. The antibody of Claim 2, wherein the antibody is conjugated to a detectable 
moiety. 

5. the antibody of Claim 2, wherein the antibody is bound to a solid support. 

6. A bioassay for determining the amount of HPBF in a biological sample 
comprising: 

a) contacting the biological sample with a nucleic add to which the HPBF 
binds under conditions such that an HPBF/nucldc add complex can be formed; and 

b) determining the amount of the HPBF/nucldc add complex, the amount 
of the complex indicating the amount of HPBF in the sample. 

7. The bioassay of Claim 6, wherein the nucleic add is the nuddc add set forth in 
SEQIDNO:3. 

8. A bioassay for determining the amount of HPBF in a biological sample 
comprising: 

a) contacting the biological sample with an antibody under conditions such 
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that a specific complex of the antibody and HPBF can be formed; and 

b) determining the amount of the antibody/HPBF complex, the amount of 
the complex indicating the amount of HPBF in the biological sample. 

9. A method of detecting the presence of a cancer in a subject comprising 
determining the presence of a detectable amount of HPBF in a biopsy from the subject, 
the presence of a detectable amount of HPBF relative to the absence of HPBF in a 
normal control indicating the presence of a cancer. 

10. A method of determining the prognosis of a subject having cancer comprising 
determining the presence of a detectable amount of HPBF in a biopsy from the subject, 
the presence of a detectable amount of HPBF relative to the absence of HPBF in a 
normal control indicating a decreased chance of long-term survival. 

11. A DNA isolate encoding the protein of Claim 1 . 

12. A bioassay for screening substances for the ability to inhibit the activity of HPBF 
comprising: 

a) administering the substance to a cell construct comprising: 
0 

the promoter region of ERBB2 linked to a reporter gene; and 
ii) 

an activated gene encoding HPBF; 

b) determining the amount of the reporter gene product; and 

c) selecting those substances which inhibit the expression of the reporter 
gene product. 

13. A bioassay for screening substances for the ability to inhibit the mhogenic 
activity of HPBF in NIH3T3 cells, comprising: 

a) administering the substance to the cells; 

b) administering HPBF to the cells; 
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c) determining the autogenic activity of HPBF in th substance-treated 
cells; and 

d) selecting those substances which inhibit the mitogenic activity of HPBF 
in the cells. 

14. A bioassay for screening substances for the ability to inhibit the production of 
HPBF, comprising: 

a) administering the substance to a cell having an activated gene encoding 

HPBF; 

b) determining the amount of HPBF produced; and 

c) selecting those substances which inhibit the production of HPBF. 

15. A method of inhibiting a biological activity mediated by HPBF comprising 
preventing the HPBF from binding to the promoter region of the ERBB2 gene 
sequence. 

16. The method of Claim 15, wherein the binding to the promoter region is 
prevented by an antisense nucleotide sequence. 

17. The method of Claim 1 5, wherein the binding to the promoter region is 
prevented by a nongenomic nucleic add sequence to which the HPBF binds. 
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5\ . TACGAATGAAGTTGTGAAGCTGAGATTCCCCTCC. 3' 
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